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On the Use of Dynamic Time Warping for 
Word Spotting and Connected Word 
Recognition* 


By C. S. MYERS, L. R. RABINER, and A. E. ROSENBERG 
(Manuscript received September 9, 1980) 


Several variations on algorithms for dynamic time warping for 
speech processing applications have been proposed. This paper com- 
pares two of these algorithms, the fixed-range method and the local 
minimum method. We show that, based on results from some simple 
word spotting and connected word recognition experiments, the local 
minimum method performs considerably better than the fixed-range 
method. We describe explanations of this behavior and techniques 
for optimizing the parameters of the local minimum algorithm for 
both word spotting and connected word recognition. 


1. INTRODUCTION 


Time registration of a test and a reference pattern is one of the 
fundamental problems in the area of automatic speech recognition. 
This problem is important because the time scales of a test and a 
reference pattern are not perfectly aligned. In some cases the time 
scales can be registered by a simple linear compression or expansion’”; 
however, in most cases, a nonlinear time warping is required to 
compensate for local compression or expansion of the time scale. For 
such cases, the class of algorithms known as dynamic time warping 
(DTw) methods has been developed. Work by Sakoe and Chiba,’ 


* The work presented here is based, in part, on the MS thesis, ““A Comparative Study 
of Several Dynamic Time Warping Algorithms for Speech Recognition, ” by C.S. Myers, 
MIT, April 1980. 
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Itakura,* and White and Neely” has shown that ptw algorithms are an 
effective method of time registering patterns in isolated word recog- 
nition systems. Bridle’ and Christiansen and Rushforth® have studied 
the applicability of DTw algorithms to word spotting, and recently, 
Sakoe,’ Rabiner and Schmidt,’ and Myers and Rabiner,’ have success- 
fully applied dynamic time-warping techniques to connected digit 
recognition. A great deal of work has been done in the area of 
performance evaluation of the various DTW algorithms as applied to 
discrete word recognition.'”"'” However, the effects of the prw param- 
eters on the overall performance of the algorithm for either word 
spotting or connected word recognition are not as well understood. 
The purpose of this paper is to discuss several proposed methods of 
applying DTW algorithms to word spotting and connected word recog- 
nition, and to study some of the factors which determine the perform- 
ance of these algorithms. 

The organization of this paper is as follows. In Section II we review 
the basic dynamic programming method of time alignment and show 
how it may be used efficiently in either a word spotting or a connected 
word recognition problem. We describe, in detail, two different DTw 
algorithms for which we have performed extensive evaluations. Section 
III contains a description of the experiments which we performed to 
evaluate the performance of the different DTW algorithms and the 
effects of the parameters associated with them. In Section IV we 
summarize the results of these experiments and draw some general 
conclusions on the use of DTW algorithms for word spotting and 
connected word recognition. 


ll. DYNAMIC PROGRAMMING FOR TIME ALIGNMENT 


In this section we first review the basic principles of DTW algorithms 
as applied to discrete word recognition, and then point out some of the 
inherent difficulties involved in applying these algorithms to word 
spotting and connected speech recognition. We then show how it is 
possible to modify the basic DTw idea so that it may be used for both 
connected word recognition and word spotting applications. 


2.1 Dynamic time warping for discrete word recognition 


The problem of time alignment for discrete word recognition is 
illustrated in Fig. 1. A reference pattern, R(n), n = 1, 2, ---, N, 
consisting of a time sequence (ie., frames) of a multidimensional 
feature vector is to be time registered with a test pattern, T(m), m = 
1, 2, ---, M, which is also represented as a time sequence of a 
multidimensional feature vector. In Fig. 1, for the sake of clarity, both 
R(n) and T(m) are shown as one-dimensional functions. We shall 
assume that both the reference and the test pattern are measured from 
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Tim) 





(i(k), y(k)) 


R{n) 


1 N 


Fig. 1—Time warping of a reference and a test pattern. 


the acoustic waveform of a single word, spoken in isolation, and that 
both the beginning and ending points of the reference and the test 
pattern have been accurately determined. The problem of time align- 
ment is to find the path, here parameterized by the function pair (i(k), 
J(k)), which minimizes a given distance metric. A typical distance 
metric” is of the form 


K 
» Auk), 7(k)) W(R) 
D(i(k), j(k)) = ————____, (1) 
N(W) 
where K is the length of the path, d(i(k), 7(R)) is the local distance, or 
dissimilarity, between frame i(k) of the reference pattern and frame 
J(k) of the test pattern, W(k) is a weighting function applied to the 
path, and N(W) is a normalization factor which is based on the 
particular weighting function that is chosen. 

In addition to minimizing the global distance, the time alignment 
path is chosen to have certain desirable properties. One important 
property is the proper time registration of the beginning and ending 
points of the test and reference patterns, Le., 


* D is shown here as a functional of the path function pair (1(R), 7 (k)). 
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(iy=1, sl) =1, (2a) 
UK)=N, j(K)=M. (2b) 


Also, the time alignment path is required to obey certain shape and 
slope constraints. For example, it would not be reasonable to allow a 
path for which a 10 to 1 expansion or compression of the time axis 
occurs. Another consideration is the preservation of time order, 1.e., 
the functions i(k) and 7(2) must both be monotonically increasing. 
These local continuity constraints are generally described by speci- 
fying the full path in terms of simple local paths which may be pieced 
together to form larger paths. For example, to reach a grid print (n, m) 
it may be reasonable to have come from any of the grid points (n — 1, 
m — 1), (n — 1, m — 2), or (n — 2, m — 1), as Shown in Fig. 2, part a. We 
refer to these constraints as Type I local constraints. Some other 
proposed sets of local constraints are shown in parts b, c, and d of Fig. 
2. The crossed out arc in part d signifies the restriction that a path 
may not move horizontally for two consecutive segments.* All these 
local constraints limit the overall slope of the time alignment contour 


moO oO mo 
m—1 oO m-1 
m—-20 O m-2 0 @) 
n-2 n—1 n n—2 n—1 n 
TYPE | TYPE OU 
(a) (b) 
mo m 
m—1 O m—10 Oo 
m—2 O m—-2 0 oO 
n—2 n—1 n n—2 n—1 n 
TYPE Ol ITAKURA 


(c) (d) 


Fig. 2—Local constraints used for dynamic time warping. 
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to be between % and 2, in accordance with the results found by Sakoe 
and Chiba.” 

To solve for the optimal time-alignment path, both the weighting 
function, W(z), and the normalization factor, N(W), must be specified 
in addition to the local constraints. Typically W(k) is chosen to be 
either of two functions, 1.e., 


W(k) = i(k) — i(k — 1) (Type a), (3a) 
Wk) = i(k) — i(k — 1) + J(R) — F(R —- 1) (Type b). = (3b) 


These two weighting functions are referred to as the asymmetric 
weighting function, Type a, and the symmetric weighting function, 
Type b, and were originally proposed by Sakoe and Chiba.’ Weighting 
function Type a weights all frames of the reference pattern equally, 
while weighting function Type b weights all frames of both the refer- 
ence and the test equally. For initialization purposes, 1(0) and 7(0) are 
defined to be 0 and thus W(1) = 1 for weighting function Type a and 
W(1) = 2 for weighting function Type b. 

The choice of N(W) is typically made such that D(i(k), j(R)) is the 
average local distance along the path defined by i(k) and 7(R), and is 
independent of both the lengths of the reference and test patterns, as 
well as the length of the time alignment path itself. The natural choice 
for N(W) is thus 

K 


N(W) = ¥ Wa). (4) 
k=1 
For weighting functions Types a and b the normalization is given by 


K 
N(W,) = 2 (i(k) — UR — 1)) = UK) — 10) = N, (5a) 


K 
N(W) a? (7(k) — uk — 1) + (Rk) — J(k — 1) 


= 1K) —1(0) + J(K) —-7(0) = N+M. (5b) 


Given a weighting function and a set of local constraints it is possible 
to define the optimal time-alignment path as that path which mini- 
mizes the total distance D(i(k), 7(k)). More formally, if we denote the 
distance associated with the optimal path as D, then 

D= min [D(i(k), j(R))]. (6) 
K,i(k),j(R) 

The soiution to this problem may be found by dynamic programming 
by use of the following optimality principle: 

Local Optimality: If the best path from the grid point (1, 1) to the 
grid point (7, m) goes through a grid point (n’, m’), then the best path 
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from the grid point (1, 1) to the grid point (n, m) includes, as a portion 
of it, the best path from the grid point (1, 1) to the grid point (n’, m’). 

Thus, if we define Da(n, m) as the minimum total distance along 
any path from the grid point (1, 1) to the grid point (n, m), then Da(n, 
m) can be computed, recursively according to the optimality principle, 
as 


Da(n, m) = min [Da(n’, m’) + d((n’, m’), (n, m))), (7) 
where d((n’, m’), (n, m)) is the weighted distance from the grid point 
(n’, m’) to the grid point (n, m). For example, for Type I local 


constraints and an asymmetric weighting function, n’ and m’ may take 
on any of the following values, 


(n’,m’) € {(n — 1, m— 1), (n-—1,m— 2), (n - 2,m—1)} (8) 


and d((n’, m’), (n, m)) is given by 


d((n — 1, m — 1), (n, m)) = d(n, m), (9a) 
d((n — 1, m — 2), (n, m)) = d(n, m), (9b) 
d((n ~~ 2, m — 1), (n, m)) = 2d(n, m). (9c) 


Thus the full DTw recursion for Type I local constraints and weighting 
function Type a is given by 


Da(n, m) = min[Da(n - 1, m — 1) + d(n, m), Da(n — 1, m — 2) 
+ d(n, m), Da(n — 2,m — 1) + 2d(n, m)]. (10) 


Using the local optimality principle, a complete DTw algorithm is given 
by the algorithm 


Step 1. Initialize Da(1, 1) = d(1, 1) W(1). 
Step 2. Compute Da(n, m) recursively forlsn=N, ls=m<M. 
Step 3. D=Da(N, M)/ N(W). | 


This completes our review of the basic principles involved in apply- 
ing dynamic programming to discrete word recognition. We will now 
describe the difficulties which arise when DTW algorithms are applied 
to connected word recognition problems and then we will show how 
the DTW principle can be modified for word spotting and connected 
word recognition applications. 


2.2 Difficulties in connected word recognition 


We shall assume that we are given a test pattern consisting of a 
sequence of connected words, spoken in a normal manner, for which 
the global beginning and ending points have been accurately located 
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and for which no further segmentation has been attempted. Given 
such a framework, the word spotting problem is to determine all 
subsections of the test pattern, if any, which match with a specified 
reference pattern, called the keyword. Thus, for word spotting a 
multiplicity of regions of the test pattern must be compared with the 
keyword pattern. 

The connected word recognition problem, on the other hand, is to 
piece together reference patterns (obtained, in all our work, from 
isolated occurrences of words) to match the test pattern. The general 
approach to this problem will be the one proposed by Levinson and 
Rosenberg,’® namely: 

(zt) Find the reference pattern that best fits a given section of the 
test pattern. 

(ut) Use the position within the test pattern at which the best match- 
ing word ends to postulate the beginning of the following word. 

(zzz) Continue to concatenate reference patterns in this manner until 
the test pattern is exhausted. 

Dynamic time-warping algorithms, as they have been applied to 
discrete word recognition applications, are not directly applicable to 
either the word spotting or the connected word recognition problem. 
There are two reasons why this is so. Figure 3 illustrates some of the 
problems which are encountered. In this figure we show the time 


CONSECUTIVE ISOLATED 
WORDS 


ENERGY 
ENVELOPE 





(b) CONNECTED WORDS 


ENERGY 
ENVELOPE 


1 N\A_s OF ht He Pe tel 3 Jee 


"38" 
TIME ———- 


Fig. 3—Log energy for two speech utterances. 
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pattern for log intensity of two speech utterances, “3,” “8” in part a, 
and “38” in part b. The utterance in part a was spoken with a 
discernible pause between the “3” and the “8,” while the utterance in 
part b was spoken with no discernible pause between the “3” and the 
“8.” Dynamic time-warping algorithms, as they have been applied to 
discrete word recognition, require a reliable set of word boundaries. 
However, as seen in Fig. 3b, a reliable segmentation for the utterance 
“38” is difficult, if not impossible, to obtain. 

Another difficulty in using DTW algorithms, based on isolated word 
reference templates, for connected speech applications is the problem 
of coarticulation between words. For example, the final /i/ of the word 
“3” and the initial /e’/ of the word “8” coarticulate strongly with each 
other. Thus, another fundamental assumption that has been relied on, 
namely that the characteristics of the isolated reference words which 
we are trying to match to our test utterance can be truly found in the 
test pattern, is not valid. In the next section we will describe the basic 
techniques that will be used to overcome these difficulties. 


2.3 Basic approaches to connected speech recognition problems 


In our approach to connected word recognition and word spotting 
we will make two changes from the structure of the isolated word Drw 
algorithm. One change is to no longer attempt to find the entire 
isolated reference pattern in the test pattern. We will still use isolated 
words as our reference patterns but will only expect a good match in 
the middle of the word, and not necessarily near the ends. Thus, we 
will not require that we be able to accurately match the beginning and 
ending points of the reference pattern to points within the test pattern. 
As a result, we would like to consider the possibility of overlapping 
reference patterns to recognize connected speech. In this manner we 
hope to account for both errors in the endpoint locations and for some 
of the gross features of coarticulation. 

Another fundamental modification to the basic DTW algorithm is the 
use of beginning and ending regions rather than beginning and ending 
frames. In this manner we hope to avoid some of the problems inherent 
in requiring an accurate segmentation of the test utterance. Figure 4 
defines, within a test pattern, a beginning region of size B (frames), 
with potential starting frames between 0, and 62 (B = b2 — b; + 1), and 
an ending region of size FE, with potential ending frames between e; 
and e2 (H = es — e; + 1). One possible DTW constraint would be that 
the best time-alignment contour may begin anywhere within the 
beginning region and end anywhere within the ending region. Three 
such potential paths are shown in Fig. 4. Such a framework would be 
used for word spotting, in which the beginning and ending regions 
correspond to the entire test pattern, or for connected word recogni- 


310 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1981 


ENDING 
REGION 








WARPING 
FUNCTIONS 





TEST 


BEGINNING 
REGION 

| 

| 





REFERENCE 


Fig. 4—Illustration of the use of beginning and ending regions. 


tion, in which the ending region for one word is used to hypothesize 
the beginning region for the next word. 

The use of beginning and ending regions modify the basic pTw 
algorithm by changing the constraints which are imposed on the ends 
of the time-alignment contour, 1.e., 


(1) = 1, JQ) =8, b,=bs by, (11a) 
(Kk) = WN, J(K) =e, ej: ee. (11b) 


Thus, to find the optimal time-alignment contour, every possible 
beginning and ending point pair must be tried, that is, 


a 


min min min LDU), J(k)) s.t. 7(1) = b, J(K) = ai]. (12) 


b)<b<b, é\;=e=eo K,tth) J (R 


The amount of computation required to solve eq. (12) for the optimal 
path can be excessive, i.e., theoretically we require B - F separate time 
warps in the most general case. However, the amount of computation 
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required to solve eq. (12) may be reduced to a single time warp by 
judicious selection of the weighting function. If W(%) is chosen to be 
the asymmetric weighting function, Type a (W.(k) = i(k) — i(k — 1)), 
and N(W) is chosen appropriately (N(W,) = N), then D may be 
computed efficiently by a modified DTW algorithm as follows: 


Step 1. Set Da(1, 6) =d(1, 6) for b= 68 bo, 
Step 2. Compute Da(n, m) recursively for 1 <n s N, 
b; =mM= eo, 


Step 3. D= x min [Da(N, e)]. 


e;=eseo 


This algorithm works because Step 1 initializes all possible beginning 
points, Step 2 computes the best path to a point (n, m) from any of the 
potential beginning points initialized in Step 1, and Step 3 finds the 
best possible ending point along any path from any possible beginning 
point. The particular choice of the asymmetric weighting function is 
important because its normalization factor is unaffected by the choice 
of the beginning or ending poits, 1.e., its normalization factor is always 
N. A dependence on the length of the test pattern, as in the symmetric 
weighting function, Type b, would require a separate time warp for 
each set of beginning and ending points because the effective length of 
the test (e — b + 1) depends on the choice of the beginning and ending 
points. 

An important factor, even with the savings of a single time warp, is 
the large amount of computation required for the Dtw algorithm. Step 
2 of the modified DTW algorithm is defined for 1 =n =N, b; = mS eo 
and this region may be as large as N-M. It is also not possible to 
significantly reduce this size by using restrictions on the slope of the 
warping contour when the ending region is left unspecified. This point 
is illustrated in Fig. 5, where the slope of the warping function 
is restricted to be between % and 2. We observe that, even with 
this restriction, when no ending region is specified, the area for which 
Da(n, m) must be computed is 4N? + B-N. 

Two modifications to the DTW algorithm have been suggested to 
reduce this amount of computation. In particular, Sakoe and Chiba? 
have proposed that a time-warping path not be allowed to deviate 
significantly from a straight line, 1.e., for any i(k), the value of 7(k) is 
restricted such that 


i(k) -i(k) -B+1/=R, (13) 


where 0 is the center of the beginning region [6 = (b; + b2)/2] and R 
is the maximum deviation which is allowed. R must be chosen to at 
least cover the entire beginning region, i.e., 2R + 1=>B. This algorithm 
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REFERENCE 


Fig. 5—Region of the (n, m) plane which is examined in a time warp for which no 
ending region is specified. 


will be referred to as the fixed range DTW algorithm and is illustrated 
in Fig: 6a. Another range-reduction technique, proposed by Rabiner, 
Rosenberg, and Levinson”® and described in detail by Rabiner and 
Schmidt® is shown in Fig. 6b. Here 7(k) is restricted to be within a 
fixed range about the best path so far, that is, the local minimum 
Formally, we have 


|7(k) — c(k)| «, (14a) 
c(k) = argmin[Da(u(k) — 1, m)], - (14b) 
c(1) = 6, : ~ (14c) 
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Fig. 6—Illustration of the fixed range and the local minimum Drtw algorithms. 


where c(k) is the position, in the vertical direction, of the local 
minimum of D,(i(k) — 1, m), and ¢ is the allowable range about this 
local minimum. Thus, if Da(n, m) is computed in successive vertical 
strips, i.e., 7 is fixed and m is varied, then the range of one vertical 
strip is te about the local minimum of the previous vertical strip. This 
algorithm is referred to as the local minimum DTW algorithm. 
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Two fundamental differences exist between these two algorithms. 
The fixed range DTW algorithm, a priori, specifies the ending region 
from the specification of the beginning regions, 1.e., 


E =2R +1, (15a) 
e=b+N-R, (15b) 
eo=b+N+R, (15c) 


while the local minimum DTW algorithm defines the ending region 
implicitly from the local minimum of the last vertical strip, 1.e., 


E = 2¢€ +1, (16a) 
ei: =c(K)—-e«, (16b) 
eg=c(K) +. (16c) 


The other fundamental difference between the two time-warping 
algorithms involves the number of time warps required to cover a 
beginning region. For the fixed range DTW algorithm the entire begin- 
ning region is most efficiently covered in a single time warp with 
2R + 1=B, rather than several smaller time warps, because overlap- 
ping time warps may be merged together without loss of accuracy. 

However, an analogous specification of the local minimum time- 
warping algorithm (2e + 1 = B) may not be truly optimal. Since one 
application of the local minimum DTw algorithm may follow only one 
local minimum path, erroneous decisions may be made because the 
true path may be “lost,” 1.e., the globally best path may not be within 
e frames of the locally best path. As such, it may be better to try 
several smaller local-minimum time warps, thus allowing several dif- 
ferent local-minimum paths to be tried, and to compare the results of 
these paths to determine the overall “best’”’ path. Such a procedure is 
illustrated in Fig. 7. We assume that NTRY local minimum time warps 
are to be computed. Each time warp has (about its respective local 
minimum) a local range of +e and the centers of two adjacent time 
warps are initially separated by 5. The entire region covered by the 
NTRY time warps is given by 


A= 2e+1+(NTRY — 1)-6. (17) 


To cover the entire beginning region, NTRY, ¢ and 6 are chosen so 
that A = B. 

In the next section of this paper we describe experiments designed 
to measure the relative strengths and weaknesses of the fixed range 
and the local minimum DTW algorithms and also to determine reason- 
able choices for the parameters 6, e, and NTRY for both word spotting 
and connected word recognition applications. 
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Fig. 7—Illustration of the parameters of the local minimum DTw algorithm. 


lil. EXPERIMENTS IN DYNAMIC TIME WARPING FOR CONNECTED 
SPEECH RECOGNITION 


This section presents the results of experiments designed to compare 
the fixed range and the local minimum DTw algorithms. We also 
describe the results of several experiments designed to study the 
parameters of the local minimum algorithm. Finally, we show how 
these results may be applied to the problems of word spotting and 
connected word recognition. 


3.1 Comparison of the time warping algorithms 


In our initial experiment the recognition accuracies achieved by 
both the fixed range and the local minimum DTw algorithms for a 
modified zsolated word recognition problem are compared. The test 
utterances consisted of 54 words from a vocabulary of computer terms, 
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spoken by each of 4 talkers, for a total of 216 utterances. The test 
utterances were recorded over a dialed-up telephone line, band-limited 
to 3.2 kHz, digitized at 6.67 kHz, and analyzed every 15 ms with an 
eighth-order LPC analysis using a 45-ms window (i.e., successive frames 
overlapped by 30 ms). Local distance scores, d(i(k), 7(k)),were calcu- 
lated using Itakura’s log likelihood ratio.* The reference patterns 
consisted of two templates per word of the vocabulary formed by a 
speaker-independent clustering technique."** 

To evaluate the relative performance of the two DTW algorithms the 
test utterances were modified so that a beginning region could be 
specified as some range about the true beginning point. No ending 
region was specified. For the sake of comparison, R and € were both 
set equal to eight frames} and NT'RY was set to one. Figure 8 shows 
the recognition results for both algorithms as a function of the four 
different local constraints (used in the DTW algorithms) defined in 
Section 2.1. We observe that the local minimum pDtw algorithm per- 
formed better than the fixed range DTW algorithm for all local con- 
straints. 

In another comparison we generated ten pseudo-connected test 
sequences by artificially embedding (at an arbitrary frame) an isolated 
digit into a connected:digit sequence, both uttered by the same talker. 
We then used both DTw algorithms to “spot” the embedded digit using 
two speaker-dependent templates per digit. The parameters of the two 
DTW algorithms that were used were the same ones as in our initial 
experiment (e€ = 8, R = 8). To spot the embedded digit, every possible 
beginning region of size 2e + 1 (= 2R + 1) was tried. The number of 
times that the DTW algorithm found the (correct) best path (as deter- 
mined by the lowest overall distance achieved by any beginning region) 
was recorded. We also recorded the ending point of the embedded 
word, as estimated by the word spotting procedure. Results showed 
that both the local minimum and the fixed range DTw algorithms were 
able to locate the endpoint of the embedded word with a high degree 
of accuracy. (The average error between the true ending frame and 
the estimated ending frame was 1.2 frames for both DTW algorithms.) 

Figure 9 shows the relative performance of the two DTw algorithms 
for this simple word spotting experiment. These figures plot the 
number of times that the particular DTW algorithm found the proper 
path (as determined by the lowest-distance score achieved) for each of 


“The speaker-independent reference template set was a subset of the 12 template 
per word set used in Ref. 14. This modification was used to reduce computation (and 
hence reduce accuracy somewhat). For the purpose of our experiments (i.e., the relative 
comparison of the fixed range and the local minimum DTw algorithms) this modification 
was of little consequence. 


+ Setting R and ¢€ equal is a fair comparison of the two methods since the computation 
is the same for both methods. 
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FIXED RANGE ({R = 8) 


LOCAL MINIMUM (€ = 8) 


ERROR RATE IN PERCENT 





I U soul ITAKURA 
LOCAL CONSTRAINTS 


Fig. 8—Results for word recognition using both the fixed range and the local minimum 
DTW algorithms. 


the ten embedded digits. We observe from Fig. 9 that the local 
minimum DTW algorithm found the best path more often than the 
fixed range DTW algorithm for almost all digits. 

We also observe that the local minimum algorithm was able to find 
the best path 17 times (the maximum number possible, 2e + 1) for 8 of 
the 10 digits, while the fixed range algorithm never achieved this 
accuracy. 


TYPE I LOCAL CONSTRAINTS 


O 3 O O 
LOCAL MINIMUM (€ = 8) 


COUNT 


FIXED 


RANGE (R=8) © 





0 1 2 3 4 5 6 7 8 9 
EMBEDDED DIGIT 


Fig. 9—Results for word spotting using both the fixed range and the local minimum 
DTW algorithms. 
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The results of these two sample experiments showed that the local 
minimum DTW algorithm performed consistently better than the fixed- 
range DTW algorithm. In the next section we describe experiments 
designed to more fully study some of the parameters of the local- 
minimum time-warping algorithm. 


3.2 Examination of the parameters of the local-minimum dynamic time- 
warping algorithm | 

To understand the effects of the various combinations of the param- 
eters A, 6, NTRY, and ¢€ on the performance of the local minimum 
DTW algorithm, a series of connected digit-recognition experiments was 
performed. A total of 80 strings of from 2 to 5 connected digits each 
(20 strings of each length) were recorded by each of the two talkers. 
These strings were the same as those used by Rabiner and Schmidt.® 
In the recognition task we used two speaker-dependent templates per 
digit. The first step in the experiment was to “spot” the ending point 
of the first digit in each string via a local-minimum algorithm (e = 11, 
NTRY = 1) using the known beginning point of the first digit. Then 
an attempt was made to recognize the second digit in the string. 
Because of inaccuracies in “spotting” the ending point of the first digit, 
and because of coarticulation effects, it was not possible to precisely 
determine the beginning point of the second digit, and, as such, a 
beginning region for the second digit was centered around the ending 
frame of the first digit, as determined by the “spotting” procedure. 
The best candidate for the second digit was chosen as that template 
which achieved the lowest overall average distance, regardless of where 
it ended. Several values of «, 6, A, and NTRY were used and the 
accuracies and distance scores for the recognition of the second digit 
were recorded. | 

Figure 10 shows, for a large value of A (27 in this case), the average 
best distance score for all NTRY time warps as a function of 6, for 
several values of «. Two curves are shown in each part of the figure. 
The solid curve is the case when the reference word is the same as the 
second word in the test strings. The dashed curve represents the case 
in which the reference is different from the second word in the test 
string. Examination of Fig. 10 shows that the average best distance for 
both “same words” and “different words” increases as 6 increases. 
However, we observe that when the reference is different from the 
second digit in the test utterance (i.e., the dashed curves), the average 
distance generally increases as 6 increases, but, when the reference and 
the test words are the same (i.e., the solid curves), the average best 
distance is constant for small values of 6 and increases only beyond 
the critical value 6 = 2e + 1. This critical value, 6 = 2e + 1 (shown by 
a caret in the scales of Fig. 10), is a particularly important value of 6 
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because for 6 < 2e + 1, consecutive time warps overlap in their 
beginning regions, and for 6 > 2e + 1 there are frames between two 
consecutive time warps which are not covered by either beginning 
region. When 6 = 2e + 1, we have the case where there is no overlap 
in adjacent beginning regions and no skipped frames between these 
regions. From the results shown in Fig. 10 we conclude that, on 
average, there is no loss in performance in the local-minimum DTw 
algorithm as long as no potential beginning frames are skipped, 1.e., as 
long as 6 S 2e + 1. 

One explanation of why 6 may be taken as large as 2e + 1, 1.e., no 
overlapping of beginning regions, without an appreciable loss of accu- 
racy, is shown in Fig. 11. Here we show the progress of a set of typical 
paths in which the starting regions overlap. By the nature of the local- 
minimum DTW algorithm, best paths from overlapping time warps tend 


f 


BEST DISTANCE 





A=27 ITAKURA‘S LOCAL CONSTRAINTS 





SAME WORD 
——— DIFFERENT WORDS 


Fig. 10—Distance scores for the local minimum DTW algorithm as applied to con- 
nected digit recognition. 
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Fig. 11—Illustration of path merging for two adjacent local-minimum time warps. 


to merge if there is a good path common to both of their beginning 
regions. Figure 12 shows the effects of path merging (of the local 
minimum DTW algorithm) on the digit recognition accuracies. Here we 
plot the recognition error rate for the second digit in the test sequences 
as a function of 6 for various values of «. We see that, for a fixed e, it 
is possible to increase 6 with essentially no loss in accuracy as long as 
6 <= 2e + 1.” 

Figure 12 also shows that ¢€ = 6 provides the minimum error rate. It 
is reasonable to expect that as € 1s made too small, good paths may 
easily become lost; but as € is made too large, incorrect paths may 
start to generate low scores and thus cause errors. Thus, a finite value 
of € is probably optimum. Unfortunately, such a value will have to be 
determined for each application. 

Another interesting effect on recognition accuracy for various com- 
binations of ¢«, 6, A, and NTRY is shown in Fig. 13. Here we plot 
recognition error rates for the second digit of our test utterances for 
two cases, namely « = (A — 1)/2 (NTRY = 1), and for the best 
combination of ¢«, 6, and NTRY (as determined by the lowest-recog- 


* Note that for A fixed, the largest possible 6 is 6 = A — 2e — 1 (NTRY = 2) so that 
the curves for the various values of € in Figure 12 are defined only for those values of 5 
such that 6 = A — 2e — 1. 
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ERROR RATE IN PERCENT 





Fig. 12—Digit error rate for connected digit recognition using the local minimum DTW 
algorithm and several values of € and 6. 


nition error rate). We see that, for smaller values of A, a single warp 
performs as well as any combination of ¢, 6, and NTRY, and as A 
increases, the differences in error rates between the best possible e, 6, 
and NTRY combination and a single warp remains less than 2.5%. 
Thus, it might be possible to perform some type of connected word 
recognition using only a single local-minimum time warp per word. In 
the next section we describe how the results of our experiments have 
actually been applied to both word spotting and connected word 
recognition applications. 


9 
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A 


Fig. 13—Digit error rates for connected digit recognition using the local minimum 
DTW algorithm. 
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3.3 Application of DTW algorithms to word spotting and connected word 
recognition 


We have shown that, both for connected word recognition and word 
spotting applications, the local minimum DTW algorithm performs 
consistently better than the fixed range DTW algorithm. We have also 
shown that, given a value of «, 6 may be chosen as large as 6 = 2e + 1 
without significant degradation in the performance of the local mini- 
mum DTW algorithm. Since, for a fixed beginning region (i.e., a fixed 
A), the number of time warps is given by NTRY = 1 + (A — 2e — 1)/ 
5, the best choice for 6 is 6 = 2e + 1. This minimizes the number of 
time warps which need to be performed. For the problem of word 
spotting the obvious choice for A is A = M, 1.e., the entire length of the 
test pattern. For this case optimal values of e« and NTRY must still be 
determined. In general, the selection of ¢€ and NT'RY depends on 
several factors. As € is increased, the chance of a missed keyword 
decreases because more paths are examined, but the chance of a false 
alarm increases. Also, as € increases, the value of NTRY decreases 
[NTRY = A/(2e + 1) for 6 = 2e + 1], thereby reducing the amount of 
computation required. Thus, misses, false alarms, and the amount of 
computation must be traded-off in the selection of « and NTRY for a 
word spotting application. 

In a connected word recognition application, however, we not only 
must choose « and NTRY but must also choose A. We have shown 
that for A < 17 frames, it is possible to do connected digit recognition 
using only a single local-minimum time warp per word. However, we 
also found that the best recognition accuracy was achieved with A = 
21 but not with a single local-minimum time warp. Thus, there is an 
apparent trade-off between recognition accuracy and speed of com- 
putation. However, work by Rabiner and Schmidt® has shown that it 
is better not to center the beginning region of one word around the 
end of the previous word, as we did, but, rather, to center the beginning 
regions of one word several frames earlier than the ending region of 
the previous word. The reason for this is that the isolated reference 
patterns tend to be longer than the spoken connected words, and thus, 
the time warps tend to overestimate the ending frame of each word. 
We tried a simple experiment in which the beginning region of one 
word was centered eight frames earlier in the test pattern than the 
end of the previous word. The values of «, NTRY, and A were e = 8, 
NTRY = 1, and A = 17. Using these values and the same test 
utterances used by Rabiner and Schmidt,’ i.e., 80 sequences of from 2 
to 5 digits each spoken once by each of six talkers, we achieved a string 
recognition rate of 429 correct strings out of 480 possible. This may be 
compared with a total of 442 correct strings using « = 8, 6 = 3, and 
NTRY = 4, as reported by Rabiner and Schmidt. It should be noted, 
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however, that the system of Rabiner and Schmidt used multiple 
candidate strings while our simple experiment did not. When we reran 
the system of Rabiner and Schmidt using only a single candidate string 
(« = 8,6 = 3, NTRY = 4) we found only 430 correct strings out of the 
480 possible. Thus, with a single local-minimum time warp per word 
we achieved results comparable to those achieved by the use of four 
local-minimum time warps per word. 


iV. CONCLUSIONS 


We have shown that dynamic time warping algorithms can be 
efficiently applied to both word spotting and connected word recogni- 
tion. We have demonstrated the relative performance superiority of 
the local minimum DTW algorithm over the fixed-range DTW algorithm. 
It was also shown that the beginning regions of successive applications 
of the local minimum DTW algorithm need not overlap to achieve 
accuracy comparable to overlapping beginning regions. We have found 
that, for small beginning regions (small A), a single local-minimum 
time warp [with e« = (A — 1)/2, NTRY = 1] was as accurate as (and 
more computationally efficient than) any combination of the param- 
eters «, 6, and NTRY. Finally, we found that an extremely simple 
connected digit recognition system, i.e., a single local-minimum time 
warp per word using only one candidate string, achieved a string 
recognition rate of nearly 90 percent. 
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The Material Dispersion Zero in Infrared 
Optical Waveguide Materials 
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The wavelength where the material dispersion is zero, i.e., that 
wavelength for which a multimode lightguide would function at 
highest bandwidth, can be estimated using fundamental materials 
properties. Operation of lightguides at or near this wavelength is 
essential for the use of very long fiber lengths with high bandwidth. 
Calculations have been performed for large groups of metal oxides, 
fluorides, chlorides, and bromides. These data can be used to locate 
materials with potential for ultralow-loss fibers at longer wavelengths 
than are currently in use. 


l. INTRODUCTION 


Operation of optical waveguides at wavelengths up to 1.3 wm and 1.6 
pm is being actively investigated and losses of a few tenths dB/km 
appear to be feasible. Operation at a longer wavelength has the 
attraction that much lower losses should be possible since the \~* 
relationship controls the scattering loss, subject of course to the 
achievement of adequately low intrinsic and impurity absorptions. 
This should permit large distances between repeaters. 

The utilization of very long lengths of low-loss fibers is critically 
dependent on the material dispersion parameter (A/c) (d?n/dA°*), where 
n is the refractive index at wavelength A and c is the velocity of light. 
At the wavelength A» where the material dispersion is zero, the delay 
distortion in multimode fibers is minimized and maximum bandwidth 
can be achieved. In single-mode fibers, zero total dispersion requires 
balancing the waveguide dispersion with the material dispersion; the 
maximum bandwidth then occurs at a wavelength longer than Ao. 

Several reports have discussed possible longer wavelength fiber 
materials, including the extended discussion of Goodman’ for the 4-um 
band, the polycrystalline materials Tl(Br,I) of Pinnow et al.’ and 
Ag(Cl,Br) of Garfunkel et al.,> and ZnCl glass of Van Uitert and 
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Wemple.’ The use of fluoride glasses based on ZrFy, HfFy, and AIF3,”® 
mainly derived from earlier work by Poulain and co-workers (e.g., 
Refs. 7 and 8) has also been studied. A limited consideration of a few! 
or only a single’ value of the material dispersion was given in some of 
these reports, while others**”* ignored this parameter. 

Wemple’ has given a general technique for calculating an approxi- 
mate value of the wavelength of the material dispersion zero, based on 
materials parameters which were discussed in detail in his previous 
report.’° Using this approach it is possible to calculate Ao values for 
large groups of potential optical waveguide materials and thus locate 
the most promising candidates for operation at desired wavelengths. 
In this study we focus our attention on metal oxides, fluorides, chlo- 
rides, and bromides which may have such a potential and from which 
glasses might be made. The reasons for concentrating on these four 
groupings and on glasses specifically, as well as further limiting factors, 
are discussed in the appendix. 

These data can, of course, be used to direct the investigation of 
specific materials with potential for ultralow loss at longer wavelengths 
than are currently in use. They can also eliminate from consideration 
some already studied materials which may be transparent in this 
region, but which have too low a Ao value. 


ll. THE CALCULATION OF Ao 


Based on considerations of optical oscillator strengths and excitation 
energies, Wemple””” has developed a formalism from which Xo, the 
wavelength for which d’n/d)* = 0 is given as 


Cae - us 
Xo = ’ 


"\areZE Se 


where F£ is the average electronic (Sellmeier) excitation gap (usually a 
few eV higher than the band gap), f is the normalized oscillator 
strength, Z is the formal valence of the anion A, n is the number of 
valence electrons on the anion in the compound, d is the anion-cation 
distance, is the reduced mass of the anion-cation pair, A is Planck’s 
constant, c is the velocity of light, and e is the charge on the electron. 

Most often n = 8 for closed-shell anions, and eq. (1), rewritten in 
practical units, then reduces to 


d? 1/4 


where Ao is in pm, d is in A, and EF and f are in eV. Here Z = 1 for 
halides, 2 for oxides and chalcogenides, and 3 for pnictides; the reduced 
mass for a compound written as AB, (i.e., with one cation A and with 
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6 not necessarily an integer) of atomic weights M, and Mz is given by 
M,sMz 


-  bMe 8) 


For compound glasses, eq. (4) can be used to determine values for 
use in eqs. (2) and (3) for a mixture of i components, containing x mole 
fraction each of PR, where P is a cation, R is an anion, and r is the 
number of anions per cation (not necessarily integral) with )); x; = 1: 


f= > xfPR,s 
k= YxEpr.,, 
Za = YxZpr, 
Ma = yxMp, 
b = xr, 
Mz = . yxrMr, 
1 
d= ra Yxrdpr. (4) 


In these equations intensive properties, such as energies, are aver- 
aged according to the composition, but extensive properties, such as 
the bond lengths and the effective cation masses, are additionally 
weighted by the number of anion bonds per cation; b is the composi- 
tion-weighted average of this last parameter. 

Values of the required parameter were taken from Wemple’s collec- 
tion of data’® when available or from the literature. Otherwise esti- 
mates for EF and f (to the nearest 4% eV and so listed in Tables I and II) 
were made following Wemple’s principles, allowing for coordination 
number, bond length anomalies, ionicity, anion contact, and shallow 
electron cores.’° This last factor is significant for the d”° core of the Ag 
halides where N is effectively 14 rather than 8, introducing a correction 
factor for Ao in eq. (2) of (14/8)'/* = 1.15; smaller corrections apply 
elsewhere, for example to the s” cores of the Tl and Pb halides. 

Wemple has also pointed out that although amorphous nonvitreous 
solids differ significantly from crystals in the applicability of the 
approach used, glasses do not; they can be viewed as loosely-packed 
versions of the crystals to a good approximation.” 


lil. VALUES OF Aj FOR OXIDES, FLUORIDES, CHLORIDES, AND 
BROMIDES 


The £ and f values used and the Ao wavelengths obtained are listed 
for oxides in Table I and for fluorides, chlorides, and bromides in Table 
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Oxide 
Mo Oz 
W 


PoP; 
As2Os 
Nb:0; 
Ta20s 
$102 
GeO, 
SnO, 
TeOQ, 
TiO. 
ZrO2 
HfO, 
ThO, 
B.2Q3 
Al,O3 
Ga203 
In2O03 
As2O3 
Sb20; 
BizO; 


Lix 


NaX 
KX 
RbX 
CsX 
AgxX* 
TLX* 
BeX: 
MgXe 
CaX2 
SrXe 
BaXo2 
ZnX2* 
CdX2 
Hg Xo 
SnX>_ x 
PbX2* 
ScX3 
YXs3 
LaxX; 
LuX3 
AlX; 
GaX3 
InX3 
TIX; 
BiX;* 
TiX, 
ZrX4 
HfX, 
Thx, 
SnX, 
PbX, 


Table |—Parameters for the calculation of A, for oxides 
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Table II—Parameters for the calculation of A, for halides 
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Fig. 1—Material dispersion crossover wavelengths for oxides. Also shown are the »,, 
vz, and v3; fundamental water absorptions; “water” in fused silica shows v3 as well as the 
indicated absorptions at lower wavelengths. 


II; the more readily accessible values of d and p are omitted. The 
results are graphically presented in Figs. 1 and 2. A definite precision 
cannot be given for Ao; we feel that most values are accurate to +10 
percent, although in a few instances the uncertainty may well be a 
little bigger. Note that since the fourth root is involved in eq. (2), the 
results are only moderately sensitive to d and —& and even less so to f 
and p. 

Most oxides with a value of & less than 44 eV were excluded, since 
such substances are deeply colored and would also be expected to 
show significant intrinsic absorption in the wavelength region of inter- 
est. An exception was made with T1,.0 (black, FE ~ 4 eV), which is 
often an ingredient in glasses, in small amounts, since it reduces the 
melt viscosity and assists in the removal of bubbles. Oxides were also 
generally excluded, which would be expected to decompose under 
conditions required for glass melting. The oxides which are strong 
glass formers are marked, as are those which are weak or conditional 
glass formers, i.e., those which do not form glass by themselves but 
will do so in combination with other similar compounds." 

The halides of Table II and Fig. 2 were selected in an analogous 
way. Here BeF2 and ZnCl. are the only strong glass formers; the 
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occurrence of glasses based on BeF»2 and other fluorides has been 
summarized by Sun.’” Glasses based on the weakly glass forming ZrF,, 
HfF,, and AlF3 have been studied by Poulain and others.’® 

Figure 1 also includes the fundamental vibrations of the free H2O 
molecule,’? marked », v2, and v3, as well as the positions of the 
dominant “water,” i.e., OH, absorptions in fused silica, consisting of 
vy; and its overtones, and combinations with SiO, vibrations.’* These 
absorptions tend to become more intense and broader with increasing 
wavelength. In view of the difficulty of eliminating all traces of water, 
operation in windows between these absorptions may be desirable. For 
fused silica these windows occur at about 1.3, 1.6, and 2.5 um; since 
SiO, vibrations are involved, shifts in both position and intensity must 
be expected in moving to different systems, although differences may 
not be large with related oxides, such as GeO. With halides, however, 
significant difference may exist. 

Another factor sometimes considered is the location of the windows 
in the atmosphere occurring at about 1.3, 1.6, 2.3, 3.5, to 4, and 8 to 13 
um.” It was primarily this consideration that caused Goodman’ to 
limit his discussion to materials for the 4-um band. This does not seem 
to be a meaningful limitation, however, since the path of optical 
waveguide communication systems does not normally include any 
atmospheric links. 


* STRONG GLASS FORMERS 
e WEAK GLASS FORMERS 


TRI AND TETRAHALIDES 


| | Sr] Ba|Zn™ ec psn l 
Be Mg Ca Sr Zn d 'Sn Pb 


Ba 


EF luisa Ik Irp Agi cs! Ire 


MONOHALIDES Li 


_ |b [Na jK [Re | Cs [Ag | Tk 
Li Na K Rb 'Cs Ag 





Ag INMICROMETERS 


Fig. 2—Material dispersion crossover wavelengths for fluorides, chlorides, and bro- 
mides. 


332 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1981 


IV. DISCUSSION 


In the case of the oxides of Table I and Fig. 1, Ao extends only to 
about 5 um, with very limited choice above 3 um. The use of chalco- 
genides, such as ZnTe, and pnictides, such as BN and GaAs, also 
included for reference in Table I, would extend Ao to much longer 
wavelengths for heavier anions, but the reduced bandgap of such 
compounds would also raise the intrinsic absorption (also see the 
appendix). 

The only strong glass formers among halides are BeF, and ZnCl:," 
the former being highly toxic and the latter hygroscopic. Figure 2 
shows a wide range of Ao among fluorides, chlorides, and bromides with 
values up to 8 um. The effect of a change in the halogen can be seen 
in the sequences of fluoride, chloride, bromide, and iodide for Li (Ao 
= 1.3, 2.3, 2.7, and 3.4 wm) and for divalent Pb (Ao = 3.3, 5.1, 7.4, and 
13.9 xm). 

Using eqs. (4) to calculate Ao for mixtures should be an improvement 
over assuming linearity between values of Ao for the end members in 
binary mixtures, and can also be used for more complex compositions. 
Some sample calculations, e.g., for LITICl:, SiGeQ.,, MgeOClo, etc., 
show that the two approaches range from agreement to producing a 
15% difference, in either direction. 

Keeping in mind the limitations discussed in the appendix, Figs. 1 
and 2 indicate a number of compositions worthy of investigation. Note 
that Goodman’ limited his thinking to wavelengths of 34 to 4 um 
because this is an atmospheric transmission window and it might 
represent the limit of room temperature sources and detectors. The 
first constraint is not relevant to a normal fiber system containing no 
atmospheric linkages and the latter could also be unimportant if, say, 
a sufficiently low loss permitted a trans-Atlantic link without repeaters: 
Operation of the two cooled terminals would seem to be a reasonable 
price for such a system. In the oxides of Fig. 1, the currently used 
compositions based on SiOz and involving relatively small amounts of 
other oxides such as GeQ:2 provide a Ao near 1.3 pm, conveniently 
centered in a “water” window of SiOz. Use of the next window at about 
1.6 wm could be possible in SiOz with a larger addition of GeOs, with 
smaller amounts of many other possible oxides, or in a single mode 
fiber at zero total dispersion by balancing the waveguide and materials 
dispersions.’° The limiting scattering loss for pure GeO» at its Ao of 
about 1.7 ym has been estimated at 0.15 dB/km."’ Note that widely 
scattered Xo values for the SiQ2-GeOz system have been reported, 
apparently none above 16 percent GeO», however.”® 

Shifting our focus to wavelengths above 2 wm necessitates omitting 
the light Si atom. One possibility now centers about compositions 
containing the strong glass former GeO2; combinations of this with 


ZERO MATERIAL DISPERSION 333 


many of the weak or conditional glass formers in Fig. 1, such as PbO,?® 
Bi2.O3,”” and PbO plus Inz03,”" are known to give glasses. The other 
possibility is to use mixtures containing only the weak or conditional 
glass formers by themselves. The scope for a Ao above 3 ym appears to 
be quite restricted, with the availability of only BizO3, PbO, and the 
very toxic Tl,O, all with relatively small values of E. 

The highest value of Ao among the fluorides of Table II and Fig. 2 is 
the 3.5 um of TIF. The glass-forming fluorides have relatively low Xo 
values, varying from the 1.2 um of BeF2 and AIF; to the 1.7 to 1.8 um 
of ZrF, and HfF,. Fluoride glasses based on some of these compounds 
have recently been investigated for waveguide use in the infrared. 
Calculations, however show relatively low Ao values of 1.91, 1.81, and 
1.79 um for three typical compositions: 57.5 HfF,, 33.75 BaF», 8.75 
LaF.”; 40 BaF, 60 ZrF4, and 10 GdFs, 60 ZrF., 30 BaF»,° respectively. 
Thus Ao values of fluoride glasses will probably not be significantly 
larger than the 1.8 um of GeOs with its much lower reactivity and 
simpler chemistry. In the absence of the discovery of new glass-forming 
fluoride systems, prospects for fluoride glasses with Ao in the region 
beyond 2 um are therefore not very promising. 

In the case of chlorides, bromides, and iodides there is only one 
known glass former, ZnCl. Some ZnCl, fiber has been prepared,’ but 
because of its hygroscopicity, only very limited studies have been 
performed so far. Weak glass formers leading to mixed halide glasses 
may exist here, as yet undiscovered. The existence of glass formation 
in mixed oxyhalides based on a combination of Figs. 1 and 2 is another 
possibility by analogy with lead oxyfluoride glass.” 


V. MATERIALS PROPERTIES 


Given the above conditions, a potentially useful glass for a low-loss 
infrared optical waveguide would need to be readily melted and 
worked, stable to devitrification and phase separation, capable of being 
adequately purified at a not unreasonable cost, and have the necessary 
mechanical and chemical properties, particularly adequate strength 
and low hygroscopicity. The ability to modify the refractive index 
without large changes in the thermal expansion is also needed for 
index profile adjustment to obtain minimum mode dispersion in mul- 
timode fibers. 

In view of these considerations, Tables I and IT have been generally 
limited to those oxides, fluorides, chlorides, and bromides with EF > 44 
which can be melted without excessive decomposition. Fluorides seem 
to have Ao values that are too low and tend to be highly reactive and 
iodides are too unstable to be serious contenders, except possibly in 
small amounts for refractive index adjustment. A large hygroscopicity, 
as in ZnCle, can be a barrier, but such a substance may still be a useful 
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component in a mixed composition. The high toxicity of some ele- 
ments, such as Tl, As, and some fluorides, are strong minuses. Chal- 
cogenides and pnictides tend to be soft, brittle, somewhat reactive 
during preparation, and have low FE and bandgap values. 


VI. SUMMARY 


We have calculated the wavelength for the material dispersion zero, 
1e., that wavelength for which a multimode lightguide would function 
at highest bandwidth, for oxides and halides using fundamental prop- 
erties. The results, as presented in the tables and figures, can be used 
to direct the search for long-wave lightguides, subject to a number of 
materials considerations. 
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APPENDIX 
Material Constraints on Ultralow-Loss Infrared Optical Waveguides 
Al. Types of fibers 


Many types of optical waveguide fibers have been proposed, includ- 
ing liquid filled ones.” Glass has demonstrated its utility, but the use 
of single crystals and even of polycrystalline materials and plastically 
deformable Ag and T! halides has also been proposed. In view of the 
basic stepped nature of single crystal surfaces and of the inherent 
scattering at grain and subgrain boundaries in the other nonvitreous 
materials, the applicability of material other than glass to ultralow- 
loss fibers remains highly questionable. 


A2. Bulk losses 


Four fundamental types of losses are important”: 

(t) the electronic transition ultraviolet absorption tail loss (Urbach 
behavior, main optical absorption edge); 

(wz) the scattering loss, derived from both intrinsic and extrinsic 
inhomogeneities, dominant on the uv side caused by the A~* Rayleigh 
variation (Raman and Brillouin effects are not important”); 

(uit) the impurity losses, predominantly from “water” and 3d tran- 
sition metal impurities, although rare earth metals may be more 
important in the infrared’; and 

(tv) the vibrational states IR absorption tail losses (reststrahlen, 
highest energy optical phonon). 

Both the uv and the IR loss regions show an exponential drop-off 
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with energy, thus creating a “window,” with minimum loss at the 
crossover point (in the absence of impurity losses). Values as low as 
10-° dB/km are indicated’ by extrapolations (which are not necessarily 
valid). Ideally, this minimum-loss wavelength would also be close to 
the location of Ao for an ultralow-loss long-distance fiber link. 

A number of general principles for minimizing the various losses can 
be listed. For minimum vv tail and scattering losses, a large bandgap 
in a material with a low glass-transition temperature, a one or few 
component system, low isothermal compressibility, an open structure 
with a low density of anions and a low formal valence on the anion, 
and low coordination with a short nearest-neighbor bond length might 
be best.” For a minimum IR tail, a large bandgap, heavy atoms with 
weak (covalent) bonding, and low melting and glass-transition temper- 
atures would lead to lower frequency vibrational states; most important 
is a very low anharmonicity so as to minimize overtones. Low impurity 
levels and a good homogeneity are obvious. Inevitably some of these 
preferences are mutually incompatible, particularly the high bandgap 
favoring both low uv and IR tails, as against a low FE (a function of the 
bandgap) lowering Ao in eqs. (1) and (2). 
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This paper reports on results that complement those in an earlier 
paper by this writer which gives a constructive proof of the existence 
of an algorithm that, for each right-hand side a, produces a sequence 
which converges globally and superlinearly to a solution x of f (x) = 
a whenever f is a C'-diffeomorphism (i.e., is a continuously-differen- 
tiable invertible map with continuously-differentiable inverse) of a 
Banach space B onto itself and either B = R” or f satisfies certain 
other conditions that are often met in applications. Here we consider 
the case in which f’ is Lipschitz on each bounded subset of B. We 
give results which, while along the lines of those obtained earlier, 
concern a fundamentally different Newton-direction algorithm which 
does not appear to have been introduced previously, and which has 
the advantage that its implementation does not require the use of 
certain search procedures. 


l. INTRODUCTION 


Let f be a function from U into B, where B is a Banach space with 
norm |-|, and U is a nonempty open subset of B. We say that f is 
differentiable on a set S C U if f has a Frechet derivative f’(s) at each 
point s of S.+ (If, for example, B = R” with the usual Euclidean norm, 
then fis differentiable on U if it is continuously differentiable on U in 
the usual sense.) By f a C’-diffeomorphism, we mean that f is a 
homeomorphism of U onto B, and f’ and (f~’)’ exist and are continuous 
on U and B, respectively. (We emphasize that here continuity refers to 
the dependence of the derivatives on the points at which they are 


*This paper was presented at the Fourteenth Asilomar Conference on Circuits, 
Systems, and Computers (Pacific Grove, California, November 17-19, 1980). 

+ In other words, f is differentiable on S C U if for each s € S, there is a bounded 
linear map f’(s):B — B such that f(s + h) = f(s) + f’'(s)h + o((A|) as|h|— 0. 
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evaluated, not to their boundedness as operators, which is assured by 
definition*) C'-diffeomorphisms frequently arise in applications. 

The purpose of this paper is to report on results that complement 
those in Ref. 1 where a constructive proof is given of the existence of 
a Newton-direction algorithm that, for each a € B, generates a se- 
quence in U which converges globally and superlinearly to a solution 
x of f(x) = a whenever fis a C'-diffeomorphism of U onto B and either 
B = R” or f satisfies certain other conditions that are frequently met: 
in applications. (For the case of an important class of monotone 
diffeomorphisms fin a Hilbert space H, the “other conditions” reduce 
to simply the requirement that f’ be uniformly continuous on closed 
bounded subsets of H. A specific example in which H is infinite 
dimensional is given in Ref. 1.) 

The algorithm described in Ref. 1 typically involves the recursive 
determination of positive scalars yo, yi, --: (which determine the 
successive steplengths) such that a certain ratio R; (y,) (which depends 
on the kth iterate x”) lies between prescribed bounds for all k = 0, 1, 
2, ---. While it is proved that the y, can be chosen as required, and 
that y, = 1 for all sufficiently large k, the actual determination of the 
vy: in a specific case would ordinarily require the use of a one-dimen- 
sional search procedure for a finite (and possibly large) number of 
values of k. 

In this paper we address the case in which U = B and f’ is Lipschitz 
on bounded subsets of B (i.e., is such that for each bounded subset S 
of B there is a constant A such that | f’(u) — f’(v) | Ss A|u — v| for all 
u and vu in S). We give results which, while along the lines of those in 
Ref. 1, concern a fundamentally different Newton-direction algorithm 
that does not appear to have been introduced earlier, and which does 
not require the use of search procedures to solve subproblems of the 
type outlined above. 

Our results are presented in Section II. As a consequence of the 
Lipschitz hypothesis, proofs are comparatively simple and we are able 
to establish quadratic (rather than superlinear) convergence. (Recall 
that a sequence x’, x”, --- in B converges quadratically to an element 
x of B if the sequence converges to x and there is a constant c such 
that |x**! — x|<c|x* — x|’ for all &.f) 

General relationships between diffeomorphisms and computation of 
the type described in Ref. 1 and in Section II do not appear to have 


* And of course, this continuity is with respect to the usual induced norm of a 
bounded linear map of B into B. 
+ Quadratic convergence results follow easily from those in Ref. 1 under the Lipschitz 
eat used here. (In this connection, see the last part of the proof of Lemma 1 in 
ection II. 
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been reported on earlier by other writers. On the other hand, as in 
Ref. 1 our approach involves the minimization of a functional, and 
therefore in a general sense there is a vast related literature. [See, for 
example, Ref. 2 and note (p. 190) that the least-squares Newton- 
direction methods described there require, in particular, the existence 
of second derivatives of f (our notation).] Additional background 
material can be found in Ref. 1. 


Il. PROCESSES No and N, 


Throughout this section we use the terms Lipschitz and converges 
quadratically in the way indicated in Section I, we denote the usual 
induced norm of a linear map A of B into B by | A|, and we take U = 
B. 

With / differentiable on B, but not necessarily a C'-diffeomorphism, 
and with x° and a any two elements of B, consider the following 
process, in which s; denotes | f(x”) — a| whenever x* € B is defined. 


Process N,: Choose p € [%4, 1) and A > 0. Do the following for k = 
Oe Were 


If f(x”) = a, set x*t! = x’. 


If f(x*) # a, determine ¢, € B such that 


f'(x")b, = a — f(x”). Then 
1. Let ye = (Ase) ifs, > 2pA7! 


= | if Sr= 2pA~’. 

2. Let y**! = x* + yadp. 

3. Set x**? = y**! if either s, > 207! and |f(y**!) -— als [1 - 
(2s) ~"]sn, or Sp S 2pA7' and | f(y**") — a| S %As3. If neither pair of 
conditions is met, replace A by 2A in Step 1 and the sentence preceding 
this sentence, and return to Step 1. 


Our main result is the following. 

Theorem 1: Suppose that f is a C'-diffeomorphism of B onto B. Let 
f’ be Lipschitz on bounded subsets of B, and let | (f~')’| be bounded 
on bounded subsets of B. Then for each a and each x°, Process N, 
can be carried out, and x', x”, --+ converges quadratically to the 
unique solution x of f(x) = a. 


2.1 Proof of Theorem 1 


Let a and x” be given. 
We first prove two lemmas which concern cases in which f need not 
be a C’-diffeomorphism. Let L = {v € B:|f(v) — a| = |f(x°) — aj}, 
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and let L denote {w + a f’(w)”'[a — f(w)]:w € L, a € [0, 1]} when 
f’(-)7! exists on L. (Assuming that L is defined, notice that it is 
bounded if L is bounded and | f’(-)~’| is bounded on L. This observa- 
tion is used later in the proof of Theorem 1 and in connection with 
Lemmas 1 and 2, below.) With 7 a positive constant, consider the 
following process. 


Process No: Choose p € [%, 1). Do the following for k = 0, 1, ---. 
If f(x") = a, set x**) = x*. 
If f(x”) # a, determine ¢; € B such that 
f'(x")bp = a — f(x"). Then let 
vr = (nsx) if sp >2pn 
=] if sp < 2p. 
Set x**! = x* + yids. 


Lemma 1: Assume that L is bounded and that f'(-) and f’(-)~' exist 
on L with | f'(w)"*| = K for w € L and some constant K. Assume also 
that f’(-) exists and is Lipschitz, with Lipschitz constant A, on L. 
Then for yn = AR’, 

(a) Process No can be carried out, and for each k such that s;, ~ 0 
we have 


Sra = [1 —- (2nsz)~"]sp if s,> 2en , (1) 
Spa = 2 nSi S pSp if s.= 2on'. (2) 
(6) s, > 0 ask— 00, 
(c) If there is an x € B such that f(x) = a and x* > x ask > ~, 
then {x} converges quadratically. 
2.1.1 Proof of Lemma 1 


We will use the following proposition. 

Proposition 1: Suppose that the hypotheses of Lemma 1 are met. If 
x* € L, and y € [0, 1], and ¢, denotes f’(x*)~"[a — f (x*)], then, for 7 
> AK®, we have | f(x* + yox) — a| S (1 — y) | f(x*) — a| + Yny’ |f (x*) 
— a)’. 

Proof: We have 

| f(x" + yor) — a] =| f(x") — a + f'(x")ybe + 8{ 
in which 


5 = f (x* + you) — f(x") — f'(x*) yor. 
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Thus 
| f(x* + yn) — a] S (1 — y) [fF (x") — a] +] 8], 


and 


b= | [f'(x" + Bydi) — f'(x")] dB-ydx. 


Since | 6| < *4Ay’K" | f(x”) — a]’, we have proved the proposition.* 

Assume now that 7 = A K?, and that the hypotheses of Lemma 1 are 
met. 

Let k be such that either k = 0, or Process No can be used to generate 
xi, .--, x* with x’ © L for j = 1, 2, ---, k. Suppose that s, # 0. Since 
x* € L, ¢, can be determined. Since p € [%, 1), when s, > 2pn7' we 
have (ys;)"' < 1. Thus, by the proposition, (1) holds. On the other 
hand, obviously 2ns; < p when s; = 2pn~*, and thus, by the proposition, 
(2) is met. This shows that x“*! can be determined, that it satisfies (1) 
and (2), and that x“*’ € L, which proves Part (a). 

Part (6) is a direct consequence of Part (a), because, by Part (a), if 
s, does not approach zero as k > © we must have s; > 2p’ for all k 
in which case [1 — (2ns0)~*] € (0, 1) and s; = [1 — (2ns0)~*]“so for k = 
1, which is a contradiction. 

Assume now that B contains an x such that f(x) = a and x* > x as 
k — ~,+ Since x* € L for all k, and L is closed, x € L. Let J denote 
f’(x). Since J is an invertible bounded linear map of B into itself, there 
are positive constants £; and 2 such that fi |u| =|Ju|s Bo|u| for u 
€ B. For each k, we have 


|f(x*) — a| =|f(x) -at+ J (x* — x) + 8, | 
in which | 6; | (J|x* — x])"' > Oask—> ~. 
Notice that for some m, 
| J(x* — x)| = 2| 6, | for k=m. 

Thus for k = m, 

| f(x*) — al =| J(x* -— x) |-| 6.) = %| J (x* — x) | = 4B | x* — x], 
and, on the other hand, 

|f(x*) —a| S| J(x* — x)| + | | = %| JT (x* — x) | Ss % Bol x* — x]. 


We have sp; = ns: for k = M for some M = m. 


* With regard to the origin of the formula for y, in Process No, notice that the right 
side of the main inequality of the proposition is minimized with respect to y at y = 
(ns). | 

+ The existence of such an x follows from our hypotheses, but this fact is not needed 
for our purposes. 
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Therefore, 
ja**?-— x] =< % nB7' B3|x*-x), REM, 


which completes the proof of the lemma.*'t 


Lemma 2: Suppose that L is bounded, and that f’(-) and f’(-)7' 
exist on L with | f’(-)~'| bounded on L. Suppose also that f'(-) exists 
and is Lipschitz on L. Then Process N, can be carried out, we have 
Ss, > 0 as k > ~, and if there is an x € B such that f(x) = a and x" 
—>xask—o, then x', x’, --- converges quadratically. 


2.1.2 Proof of Lemma 2 


Consider Process N;. By Lemma 1, there is a constant Ao that 
depends only on f, a, and x° such that if A in Step 1 and the first 
sentence of Step 3 satisfies A = Ao, and if either k = 0 and so ¥ 0, ork 
> 0 and Process N, can be used to determine x” with s, ~ 0 and s,; < 
So, then Step 3 can be carried out on the first pass. Notice that 
whenever x"! is set equal to y“*! in Step 3, we have Spi: < Sp. 

Since for any A > 0 there is a nonnegative integer p such that 27 
= Xo, it follows that Process N, can be carried out, and that for some 
nonnegative integers g and r, we have s**' < o,s* for k = q, where op 
= [1 — (2"*'\s,)~'] when s* > 2p(2"A)~' and o; = p otherwise. Since 
o, <1 for k=q, it is clear that s,; > 0 as k > o, and therefore that s;4; 
< \% 2’\si. for k = M for some M. Thus, by the proof of Part (c) of 
Lemma 1, our proof of Lemma 2 is complete. 

Now let the hypotheses of Theorem 1 be met. The proof of Theorem 
3 of Ref. 1 shows that L is bounded, that f’(-)~' exists on B, and that 
| £’(-)* | is bounded on L. Since fis a homeomorphism of B onto B, s; 
— 0 as k > ~ implies that x” — x as k > », where x satisfies f(x) = 


a. By Lemma 2, this completes the proof of Theorem 1. 


2.2 Monotone diffeomorphisms in Hilbert space 


Let wW:[0, 0) — [0, ©) be continuous, strictly increasing, and such 
that ¥(0) = 0, V(a) — © asa— ©, and a ‘v(a) = c for a € (0, @) for 
some positive constants c and a. Notice that, for example, Y(a) = a 
meets these conditions. 


*'The fact that {x*} converges quadratically follows from a direct extension of a 
known result (see Ref. 3, p. 312) since either s, = 0 for some k, or there is an M such that 
ys = 1 for k = M. The short proof given above is included for the sake of completeness. 


+ D. J. Rose has informed this writer that in recent independent joint work with R. 
Bank,’ done subsequent to the appearance of preprints of Ref. 1, a corresponding result, 
as well as a result corresponding to Theorem 1, was obtained for a process in which y, 
= (1 + 7s,)"', where the 7, satisfy certain inequalities. They study a case in which an 
approximation M, to f’(x*) can be used in place of f’(x*). Also, earlier related work 
along different lines concerning uniformly monotone gradient maps /:R” — R” was done 
by Bank and Rose.” 
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Theorem 2: Let f map a real Hilbert space H, with inner product 
(-, +), into itself such that (f(u) — f(v), u-— v) = |u—vl|(|u— ov) 
for all u, v € H. Assume that f’ exists and ts Lipschitz on bounded 
subsets of H. Then f is a C'-diffeomorphism of H onto H, and the 
conclusion of Theorem 1 holds. 

Using Theorem 1, a proof of Theorem 2 can be obtained by trivially 
modifying the proof of Theorem 4 of Ref. 1. 


2.3 B= R" 


The following complete result is a direct corollary of Theorem 1 (see 
the proof of Theorem 5 of Ref 1). 

Theorem 3: Let B = R”, and let f’ be Lipschitz and continuously 
differentiable on bounded subsets of R". Then f is a C'-diffeomorph- 
ism of R” onto itself if and only if 

(i) Process N, can be carried out for each a and each x’. 

(tt) For each a, the sequence produced by Process N, converges 
quadratically to a solution x of f(x) = a, and x does not depend on x’. 


2.4 Comments 


As in Ref 1, our primary purpose is to focus attention on general 
relationships between diffeomorphisms and computation. Clearly, no 
attempt is made to optimize the performance of all aspects of the type 
of algorithm described. However, there are some basically self-evident 
modifications that are sometimes useful. For example, the total num- 
ber of iterations required in a specific case can sometimes be reduced 
significantly by repeatedly, or occasionally, stopping the algorithm 
after a number of steps and resetting the initial value of A in Process 
N, to a smaller number. (It is not difficult to give rules of thumb 
concerning when to stop the algorithm and by how much to reduce X, 
but we have not tried to prove theorems that bear on these matters.) 
Of course, bounds on the location of the solution and estimates of K 
and A, which are available in some problems, can be used in an obvious 
way. Similarly, if for example B = R”, a globally convergent steepest- 
descent process (see Ref. 6)* might be used initially to obtain a better 
approximation to the solution before the Newton-direction algorithm 
is used. (In fact, a well known and often useful strategy is to combine 
steepest descent and pure Newton iterations in this way.t) 


* We take this opportunity to correct a typographical error in Ref. 6. On page 1004, 
left column, line 2, [2] should be replaced with [21]. 


} This paragraph was motivated by a helpful observation by D. J. Rose to the effect 
that, as the algorithm stands, there are cases in which many iterations are required. 
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Program Development by Stepwise Refinement 
and Related Topics 
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(Manuscript received September 22, 1980) 


Computer program development by stepwise refinement has been 
advocated by many people. We take another look at stepwise refine- 
ment in light of recent developments in programming languages and 
programming methodology such as abstract data types, correctness 
proofs and formal specifications, parallel programs and multiversion 
programs. We offer suggestions for the refinement process and discuss 
program maintainability. 


I. INTRODUCTION 


The correct design.of nontrivial programs and systems of programs 
is an intellectually challenging and difficult task. Often programs are 
designed with very little time spent on the design itself, the effort 
being concentrated on coding. This could be due to management’s 
desire to see something working as soon as possible to be assured that 
work is progressing, or it could be due to the programmer’s desire to 
“attack the problem right away.” 

Not only is there no emphasis on design, the approach to it is also 
not systematic or disciplined. This results in programs that do not 
meet specifications in terms of correct output and performance require- 
ments. 

What we want is a programming methodology that puts some 
discipline and structure in the design process without stifling creativity. 
A programming methodology should: 

(1) Help us master the complexity of the problem being solved and 
give us some guidelines on how to formulate the problem solution. 

(tt) Provide us with a written record of the design process. The 
design can then be read by others, and the design decisions can be 
appreciated or constructively criticized. 

(iz) Result in programs that are understandable. 
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(tv) Lead to programs whose correctness can be verified by proofs. 
Since proofs are difficult, the methodology should allow for a sys- 
tematic approach to program testing. 

(v) Be generally applicable and not restricted to a class of problems. 

(ut) Allow for the production of efficient programs. 

(vit) Allow for the production of programs that can be modified 
systematically. 

In this tutorial we discuss a programming methodology called step- 
wise refinement and informally show that it satisfies these criteria. 


ll. STEPWISE REFINEMENT 


Stepwise refinement is a top-down design approach to program 
development (first advocated by Wirth*). Wirth really gave a sys- 
tematic formulation and description of what many programmers were 
previously doing intuitively. According to Brooks,” stepwise refinement 
1s the most important new programming formalization of the decade. 
Stepwise refinement is applicable not only to program design, but also 
to the design of complex systems. 

In a top-down approach, the problem to be solved is decomposed or 
refined into subproblems which are then solved. The decomposition or 
refinement should be such that:** 

(t) The subproblems should be solvable. 

(tt) A subproblem should be solvable with as little impact on the 
other subproblems as possible. 

(zit) The solution of each subproblem should involve less effort than 
the original problem. 

(iv) Once the subproblems are solved, the solution of the problem 
should not require much additional effort. 

This process is repeated on the subproblems; of course, if the solution 
of a problem is obvious or trivial, then this decomposition is not 
necessary. 

If Po is the initial problem formulation/solution, then the final 
problem formulation/solution P, (an executable program) is arrived 
at after a series of gradual “refinement” steps, 


Pp >P>Po=>.--=>P,. 


The refinement P;4; of P; is produced by supplying more details for 
the problem formulation/solution P;. The refinements Po, --+, Pn 
represent different levels of abstraction. Po may be said to give the 
most abstract view of the problem solution P,, while P, represents a 
detailed version of the solution for Po. 

As an example of abstraction levels, consider a program that auto- 
mates the record- keeping of an insurance company. At the highest 
level of abstraction, the program deals with the insurance company as 
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an entity. At succeeding lower levels of abstraction, the program deals 
with 


different insurance categories (auto, home, life, etc.) 
groups of policies in the above categories 

individual policies in the above groups 

details of individual policies 


oo 8 08 


Each refinement P; consists of a sequence of instructions and data 
descriptions P;,;, 
Pi 


° 
e 
Pin, 


In each refinement step, we provide more details on how each P; is to 
be implemented. The refinement process stops when we reach a stage 
(t) where all the instructions can be executed on a computer, or 

(tz) where instructions can be easily translated to computer execut- 
able instructions. 

Pictorially, the refinement process may be depicted as shown in 
Fig. 1. The final program is a collection of the nodes at the last 
refinement level P,,. 

The design can be probed to any desired level of detail 1 (0 =i <n). 
Understanding the design process is aided by the fact that level 1 
provides an overview of levels 1 + 1 through n. 

We illustrate the stepwise refinement process with annotated ex- 
amples. The notation we will use for conveying our ideas will be 
Pascal-like’ and include guarded commands.° PL/I will be used to show 
the executable versions of some programs. 


Py 





Fig. 1—The refinement process. 
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The guarded commands are 
(t) Selection 


if b — SL, 
| |b. ——> SL> 
[]b, — SL, 
fi 


The 6,’s are called the guards (Boolean expressions) and the SL; are 
statement lists. For a successful execution of the selection statement, 
at least one of the guards must be true. If only one guard is true, then 
the corresponding statement list is executed. If more than one guard 
is true, then one of the corresponding statement lists is selected 
nondeterministically (i.e., the user cannot tell beforehand) and exe- 
cuted, e.g., 


if a>b —> max:=a 
[]Jo=a —> max:=b 
fi 


If a = b, then both the guards are true and either of the statements 
max := a or max := b may be executed. Either way, the answer is 
right. This symmetry is aesthetically pleasing when compared to 
conventional deterministic programming. 

(it) Repetition 


do b, — SL, 
{ |b. —_ SL> 


[]b, — SL, 
od 


The loop is repeatedly executed as long as one of the guards is true. 
If one guard is true, then the corresponding statement list is executed. 
As in the selection statement, if more than one guard is true, then one 
of the corresponding lists is arbitrarily selected and executed. 

Implementation of these statements in C, Pascal, PL/I, etc., will be 
deterministic. For example, in PL/I: 

(1) Selection 


IF by 
THEN DO; SL,; END; 
ELSE IF bs THEN DO; SL2; END; 


ELSE IF 6, THEN DO; SL,; END; 
ELSE ERROR; 
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(zt) Repetition 


L:DO WHILE (‘1° B); 
IF b, 
THEN DO; SL,; END; 
ELSE IF b2 THEN DO; SL.; END; 


ELSE IF b, THEN DO; SL,; END; 
ELSE GOTO LE; 
END L; 
LE:; 


Note: These statements could be more conveniently implemented 
using the new PL/I SELECT and LEAVE statements. 


ll. EXAMPLES OF STEPWISE REFINEMENT 


The examples used to illustrate stepwise refinement are small out of 
necessity. The reader is encouraged to apply stepwise refinement to 
larger problems. 


Example 1 


Write a program to simulate a week in John’s life. 
Initial refinement Po: 
Simulate a week in John’s life 


If we were programming in a language that understood the above 
instruction, then we wouldn’t have to refine it further. 


Refinement P,: 


a. d := monday {next day to be simulated is d} 
b repeat 

C. simulate day d in John's life 

d. d := next day 

e until week over 


A refinement consists of programming language instructions mixed 
with English statements. 


Refinement P32: 
Line c of P; is refined as 


Sleep until alarm goes off 
Go through morning ritual 
Spend the day 

Go through evening ritual 
Prepare to sleep 


Line d is refined as 
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monday 
SUCC(d) 


if d=sunday —- dd: 
[ld~ sunday —- d: 
fi 


where the Pascal function SUCC gives the next day in the range of 
values monday, tuesday, --- , sunday. Line e: “week over’ is refined as 
“d = monday.” 

Collecting these refinements of P;’s instructions, we get refinement 
Pp». 


d := monday 
repeat 
Sleep until alarm goes off 
Go through morning ritual 
Spend the day 
Go through the evening ritual 
Prepare to sleep 
if d=sunday —- d:= monday 
[ld#sunday — d:= SUCC(d) 
fi 
until d = monday. 


This collection can be done mechanically and we shall in general omit 
it. 
“Spend the day” may be refined as 


if weekday — _ go to work 
work 
return home 
[]Jweekend -— _ read newspaper 
laze around 
read book 
watch TV 
fi 
Similarly, the other instructions of Pz, may be refined and the 
refinement process continued to the desired level of detail. In the 
refinement we have tried to model processes of the problem domain.’ 
An initial decomposition might not be feasible or nice, in which case 
we back up and try another decomposition. We shall only present the 
final set of decompositions. 


Example 2 


Write a program that reads in a list of positive numbers a), az, 
-++, @, (n = O) and prints the sums of all natural numbers up to each 
a;, 1.e., the sums: . 


aj ag an 
yg De 8, , 2a 
i=0 1=0 1=0 
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Initial refinement Po: Print )\%o 1, Yo 1, «++, Vo 7. 


PY: 
read a 
do while there 
exists data -—> Compute sum = ) i 
i=O 
Print sum 
read a 


od 


Because we are aiming for an executable program in a sequential 
programming language, the refinement P; reflects the decision to read 
in an input element, compute its sum, print the sum, and then read 
another input element. Alternately, had our target been a parallel 
computer we would have probably read in all the input elements, 
computed the sums in parallel, and then printed them out. Many 
implicit decisions underlie every refinement. 


P.2: @ while there exist data 
is refined to 
not EOF 
e Compute sum = 4.9 i 
is refined to 
i:=0O 
sum := 0 {sum =O+1+2+.-- +i} 
doi~¥a — /:=i+1;sum:= sum +/od 

Let us now examine the concept of a loop invariant. A loop invariant 
is an assertion about program variables; it statically captures the 
meaning of a loop thus helping us understand it. Loop invariants are 
true before and after the execution of a loop, and before and after each 
execution of the loop body. Dijkstra® suggests some ways of finding 
the loop invariant using the desired post-condition (state of variables 
after the loop terminates). The loop invariant can actually aid in 
determining the guards and the corresponding statement lists. 

Let I be the loop invariant sum = 0 +1+2+--- +1, J is true 
initially because z: = 0 and sum = 0. Evaluation of the guard 1 ¥ a does 
not affect J; the statement z := 1 + 1 destroys J, resulting in sum = 0 
+1+2+.--- +12-—1. But sum = sum + 1 restores the validity of 
invariant J. When the guard evaluates to false, i.e., 1 = a, the loop 
terminates. Now in addition to J being true we have i = a, implying 
the desired result sum =O0+1+2+.--- +a. 

How can we demonstrate loop termination? For this we must show 
the existence of a function, initially = 0, whose value is decreased by 


* The fact that we have two read statements in P; shows that our design is influenced 
by our target language (PL/I in this case). In PL/I, unlike in Pascal, a read must occur on 
an empty file before an end-of-file is indicated (via the variable EOF in our case). 
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one every time the loop is executed. When this function becomes < 0, 
we stop. Such a function is a — 1; executing 2 := 1 + 1 decreases its 
value by 1. When a — i = 0, we have 1 = a which is when the guard 
evaluates to false and the loop terminates. 

Continuing the refinements we get 


P3: 
read a 
donotEOF — {compute sum = » i 
i=0 
i:=0 
sum := 0 {sum =O0+1+2+.--- +/} 
doi#a —> j:=it+ 
sum := sum + / 
od 
print sum 
read a 
od 


P, (in PL/I): 


SUM: PROC OPTIONS(MAIN); 
DCL (A /*NEXT INPUT ELEMENT */ 
1 /*LOOP VARIABLE * / 
SUM /*SUM=0414+2+..--4+/ */ 
)FIXED DEC, 
EOF BIT(1) INITC'O'B); 
ON ENDFILE EOF=‘1'B; 


GET LIST(A); 

DO WHILE (~EOF); 
/=0; SUM=0; 
DO WHILE (/~=A); 

l=/+1; SUM=SUM++I!. END; 

PUT SKIP LIST (‘ SUM UPTO’, A, ‘IS’, SUM); 
GET LIST(A); 
END; 

END SUM; 

Example 3 
Write a program to determine the maximum element value of an m 


xX n array A (m,n = 1). 


Po: determine the max element value of A 
P,: i:=0 {last row examined} 
initialize max {max is the maximum element 
of rows 1 --- /—Iloop invariant /,} 
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do all rows — f:=i+1 
not examined max:=maximum(row /, max) 
od 
P.: @ initialize max is refined to 
max := A[1, 1] 
e all rows not examined 


is refined to 
im 
® max := maximum (row /, max) 
is refined as 
j =O {max = maximum of rows 1...i— 1 and elements 
1 .../ Of row i—loop invariant Io} 
do all elements of > ji=jt+ 
row / not examined max := MAX(max, Al[/, j ]) 
od 


I, is the loop invariant. As an exercise, the reader should try and show 
that the loops leave f; and J; invariant, 1.e., unchanged. 
P3: @ all elements of row j not examined 
is refined as 
J#n 
e max := MAX(max, A[z, 7 ]}) 
is refined as 
if max=Alfi,j/] — — skip 
[]Jmax = Afi, j] — max:=Afi, /] 
fi 


where skip denotes the null statement. 

The iterative feature is the most important feature of a programming 
language.” The do --- od construct allows us to express algorithms 
clearly and succinctly. The above example could have been done better 
had the author not used the do --- od construct to just simulate the 
while statement. Making fuller use of the do --- od construct, we get 
the following program for the above problem: 


P 3: i:=0O {number of rows examined so far} 
j:= 0 {number of elements of row i + 1 examined so far} 
initialize max {max is the maximum of all the elements in the first 


i rows and the first / elements of row/ +1} —/ 
do i<mrowsandj<n — j:=j+1 


elements of row/ + 1 max := MAX(max, Afi, /]) 
examined 

[|i < m rows and all — move to the next row 
elements of row/ + 1 
examined 


od 


STEPWISE REFINEMENT PROGRAM DEVELOPMENT 355 


P3: (:=0; j:=0 
max :=A[1,1] {max is the maximum of all the 
elements in the first / rows 
and the first / elements of rowi+1}—/ 
do i<xmandj<n —> j:=jt+1 
max := MAX(max, Ali, /]) 
[i<mandj=n > f:=i+1;/j/:=0 
od 
Gries also shows that the do --- od construct usually eliminates the 


need for loop exits necessary in programs that use the while state- 
ment.® 


Example 4 

The Touch-Tone® telephone provides an easy but limited means of 
communicating with a computer (see Fig. 2). The problem is to write 
a program that provides a simple adding machine to the user.” For 
example: 


e User offhook 1 # 52 # 4x # onhook 
Input: 
e System “one” “SIX “ten 
response: point point 
(audio) two” seven” 


The characters # and * represent + and .-, respectively. 

The following modules are available to the programmer: 

e SPEAK (string)—provides an audio response for the number 
represented by the string. 


ea ES) et 


ey EP GE] EI 
ee Ley feet i 


Fig. 2—The Touch-Tone® telephone’s pushbutton dial. 
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p21} th | 38] null | 


° ADD (string1, string2) — string] := string] + string2 
8 | - | 3{ null | string 1 
2g 4 | null | string 2 
pif 2]. | 3) null | string 1 


e waitsignal(char)—sets char to the next input character when 
available. 


The input/output specifications written more formally are: 


input: offhook fi # f2 # +++ # fr # onhook, 
integer one or more digits 


real —one or more digits followed by 
—one or more digits followed by * and at 
least one digit 
—one * followed by at least one digit 


where f; = 


(l<i<n) 


output: SPEAK(SUM;), SPEAK(SUM)), ---, SPEAK(SUM,,) 
where SUM; = Yi-i fr, 1S isn, 
and the audio response occurs after the character # is input. 
We assume that the maximum length of numbers input will be k — 
1. To focus on the refinement process, we make the following additional 
assumptions: 
(t) one addition session, 
(iz) no errors of any kind. 
In the second version of the solution we will eliminate these restric- 
tions. 
Refinement Po: Do telephone addition. 


Py: Compute and speak out the running sum of the numbers 
input. 
P»: plus := ‘#’; point := ‘*’ 


waitsignal(c) {c contains the next 
input char to be processed; 
offhook is the first one} 
waitsignal(c) {get char after offhook} 
initially SUM is O 
do c # ‘onhook’ — _ read number into A 
ADD(SUM, A) 
SPEAK(SUM) 
waitsignal(c) {+ consumed} 
od 
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The number variables SUM and A are implemented as strings because 
modules SPEAK and ADD expect strings as arguments. 

Each variable is represented by 

(t) variable of type string, 

(it) an integer variable that denotes the length of the string, 
where type string = array [1 --- k] of char. 
In addition to SPEAK and ADD we need 

(t) a procedure ZERO to initialize the numbers represented as strings 
to 0, 

(tz) a procedure APPEND to help build a number. 

They are defined as 


procedure ZERO(var X:string; /:integer); 
begin /:=1; X[/]:=null end 

procedure APPEND(var X:string; /:integer; c:char); 
begin X[/]:=c; /:=/ + 1; X[/]:=null end 


The implementation of numbers and their operations in terms of 
strings is an example of data refinement. 
e Read number into A is refined as 


ZEROA 
doc+#plus — ifc=point — APPEND‘.’toA 
[]}c # point — APPENDctoA 
fi 
waitsignal(c) 
od 


Collecting the refinements together we get 


const point = ‘+’; plus = ‘#'; 
type string = array[1 -- k] of char; 
var SUM, A:string; 
ISUM, /A:integer; {lengths of SUM, A} 
begin waitsignal (c); waitsignal (c); 
ZERO(SUM, /SUM); 
do c ¥ ‘onhook’ — {read number into A} 
ZERO(A, IA) 
do c ¥ plus 
ifc = point — APPEND(A, JA, ‘-’) 
[]c # point — APPEND(A, JA, c) 
fi 
waitsignal(c) 
od 
ADD(SUM, A); 
SPEAK(SUM); 
waitsignal(c) 
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od 
end 


Instead of including inline the refinement of “read number into A”’ 
in the final version of the program, it would have been more appropri- 
ate to make the refinement into a procedure READ and to call READ 
from the final version. This is because READ and the operations ADD 
and SPEAK (which appear in the final program) operate on numbers 
thus representing the same level of abstraction. Also, if an instruction 
appears more than once, then it should perhaps become a procedure 
call. The instruction would then be refined only once. 

In the following version of the above program we eliminate the 
restrictions of a single session and no errors. The following types of 
errors are considered possible: 


illegal characters, 

n = 2 decimal points per number, 
only a decimal point—no digits, 

+ follows +, offhook, i.e., null number, 
session starts with other than offhook. 


The initial problem formulation Pp is 
dotrue — Do telephone addition od 


P,:¢ Do telephone addition is refined as 
Compute and speak out the running sum of the numbers input 
so far. 

This is refined as 

P2: waitsignal(c) 


if c# ‘offhook’ — error 
[] c = ‘offhook’ —_ skip 
fi 
waitsignal(c) 
check for valid char {digits, plus, point, ‘onhook’} 
initially sum is O 
do c # ‘onhook’ — Read number into A 
ifc=plus —> ADD(SUM, A) 
SPEAK(SUM) 
waitsignal(c) 
check for valid char 
[] ¢ = ‘onhook’ — skip 
fi 
od 
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Using the procedures APPEND and ZERO defined before, we refine 
read number into A as 


ZERO A 

# digits, # pts := O; 

docisa | 
digit or a point 


— ifcisa point — Append '‘-’toA 
if # pts = O— # pts := ¥ pts + 1 
[] # pts  O — error 
fi 
[] cis adigit— AppendctoA 
# digits := # digits + 1 
fi 


waitsignal(c); 
check for valid char 
od 
if # digits = O — error 
[] # digits + 0 > skip 
fi 


Continuation of this refinement process is similar to the error-free 
version and we omit it. 


Example 5. McDonald's warehouse problem °® 


Given a list of item cards ordered by item number, produce the 
management report shown in Fig. 3. Each invoice has an item number, 
a code D for delivery, and R for received, and the quantity received or 
delivered. 

Po: Produce management report 

P,: a. Print heading 

b. Process the item groups 
c. Print number of item groups changed 


MANAGEMENT REPORT 


ITEM NET CHANGE 
P, 25 


# CHANGED = 20 





Fig. 3—From item cards to management report. 
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° Line b is refined as 


# changed := 0 


read item 
do there are more — process an item group 
item groups print item and net change 
#% Changed := # changed + 1 
od 


° there are more item groups 
is refined as 
not EOF 

© process an item group 
is refined as 


netchange := O 
itemgroup # := item# 
do item in group and not EOF 
— if code = R — netchange := netchange + Qty 
[] code = D— netchange := netchange — Qty 
fi 
read item 


od 
° item in group 
is refined as 


itemgroup # = item# 


This concludes the example. 


Example 6 


Using insertion sort, sort the array A (size n = 1) in nondecreasing 
order, 1.e., Ay = Ay = --- = An, and the new values of array A are a 
permutation of its old values. 

Po: Sort the array A 

Pictorially we can characterize the input and output specifications of 
the array A as 

(z) initially 


1 n 
/ KY 
sorted unsorted 
part part 
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(ii) finally 


1 n 
. sorted 7 ® unsorted 
part part 


At some intermediate stage of the sorting we will have 


1 l n 
sorted 7 \ unsorted 
part part 


This picture corresponds to our loop invariant. It is true initially if 
it = 1. At the end of the loop, 2 = n implies that the whole array is 
sorted. So the purpose of the loop body will be to exchange the values 
of A in such a way that 7 can be increased until it equals n. Po is 
therefore refined as 
Py: 

i:=1  {A[1 -- i] sorted} 
do i#n- Extend sorted portion to include A[/ + 1] 
f:=i+1 
od 
e Extend sorted portion to include Afi + 1] 
is refined as 


a. t:=Alit+ 1] 
b. shift all elements of A[1 --- i] >t 
one place to the right such that 
A{i---j-1i]standA[j+1---f/+1]>t 
c. A[j]:=t 
e Line 0 of the above refinement is developed as 
jrsit1 {Aft .--- i441] >0 
do A[j — 1] > t— shift A[j — 1] to the right 
j:=j-1 
od 
On loop termination we have A[7 +1--21+1]>tandA[j-—1]=s 
t. This, along with the fact that at the start A[1 --- 1] was sorted, 
leads us to A[1 ---7—1] St. 
As we have not taken proper care of the end condition, the guard in 


the above loop will cause a subscript error when 7 = 1. So we modify 
it to 


j#1candA[j—1]>t 
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where cand is similar to and but the second operand is evaluated only 
if 7 ¥ 1 (C has a similar operator). This allows for a simple high level 
design. 

° shift A[7 — 1] to the right is refined as 


ALi) := ALi- 1] 
Collecting all the refinements we get 
i:=1 
doi4n—>t:=Alit+ 1], f:=i+ 1 
doj#1candA[j-—1]>t—-A[j]:=A[s/-1] 


Jj:=j-1 
od 
Ajj] :=t 
i:=1+ 1 


od 


We conclude this section with some comments on program mainte- 
nance and efficiency. Program maintenance, 1.e., program modification 
that is due to changing specifications or in response to design errors, 
should be carried out by making changes in the refinements and not 
just the final program. Making changes in only the final program 
renders the design (i.e., refinements) obsolete; consequently, an up- 
dated version of the design will no longer exist and subsequent program 
maintenance becomes increasingly difficult. 

When program specifications change, start from the initial refine- 
ment and locate the refinement affected. Modify this refinement and 
carry the effects of this change down to the last level of refinement. 

When a design error is detected, locate the most abstract refinement 
in which the design error was first made. Then carry the change caused 
by the removal of the design error down to the final refinement. 

A program that does not have the desired efficiency (i.e., perform- 
ance) characteristics must be redesigned. We locate the most abstract 
refinement R where a design decision was made that resulted in these 
characteristics. A proper design modification from refinement FR on- 
wards leads to the desired efficiency characteristics. Predecessors of 
refinement R remain unchanged. 


IV. RECURSION 


Stepwise refinement and recursion blend naturally with each other. 
Many programmers avoid recursion and treat it as a novelty.'? Some 
problems are best expressed recursively, even though languages like 
FORTRAN and COBOL are not recursive, and this inhibits programmers 
from thinking and designing recursively. Also, the examples of recur- 
sion in text books, e.g., factorial, Fibonacci numbers, etc., are not 
convincing about its utility. 
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Efficiency has often been cited as a reason against using recursion. 
In these days of increasing software costs and decreasing hardware 
costs, this reason is not very convincing. It is better to have a recursive 
design that is simpler, easier to understand, and easier to show correct 
than a corresponding nonrecursive version. If efficiency is still a 
criterion, then the recursive design can be systematically transformed 
into a nonrecursive one.'! The following examples illustrate the devel- 
opment of recursive programs: 

1. Write a procedure to print a binary tree with root R (Fig. 4). 
Each node is of the form 


where 


({) VALUE is the data at the node, 

(i) LEFT, RIGHT are the pointers to the subtrees, 

(zit) a NIL pointer value denotes the absence of a subtree. 
Initial refinement Po: Print binary tree with root R 


P,: 
ifR ANIL — Print binary tree with root LEFT(R) 
{left subtree} 
Print VALUE(R) 
Print binary tree with root RIGHT(R) 
{right subtree} 
[JR =NIL —- skip 
fi 


Fig. 4—A binary tree with root R. 
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The notation p(x) denotes the component p of the node pointed to by 
x. Writing the above in Pascal we get: 


procedure print (AR: f node); 
begin 
if R Anil 
then begin print(R t. LEFT); 
writeIn(R fT. VALUE); 
print(R t. RIGHT) 
end 
end 
2. This example illustrates a fast sorting technique called quicksort 
(Hoare’”). Array A, with bounds L and U (L s U), is to be sorted in 
nondecreasing order. 
Initial refinement Po: Quicksort(A, L, U). 
Py: 
if one element — skip 
[ ]two elements — order them 
[ Jmore thantwoelements — Partition A such that 
Lgjiuivu 


(at least one element per partition 
in this case) : 
where r is an arbitrary value 
Quicksort(A, L, 7) 
Quicksort(A, 2, U) 
fi 
P.: @ one element 
is refined as 
U-L=0 
° two elements is refined as U- L=1 
© order them 
is refined as 
if AU] <A[L] — swap(A[L], A[U]) 


[JA[UJ=A[L] —. skip 


fi 
e more than two elements 
is refined as 
UG =1n> 1 
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e Partition A such that --- 
is refined as 


r:i=A{(U+L)+ 2] 
P=Li j= U{A[L..i-1])srandA[j+1..U] =2r-— Invariant /} 
doi<j — _ Extend left partition by increasing / 
Extend right partition by decreasing / 
Rearrange elements so that invariant / is 
restored 
od | 
P3: e Extend left partition --- 
doAlij<r — i:=i+10d {A[/J2r} 
e Extend right partition --- 
doA[jJ>r > j:=j-—10d {A[j] sr} 
e Rearrange --- 
ifi<j — swap (A[/], AL /]) 
Pi=f+ jy i=s-1 
[j}i=s —- skip 
fi 


V. MULTIVERSION PROGRAMS 


A set of programs is said to constitute a program family if it is worth 
while to study programs from the set by first studying the common 
properties of the set and then determining the special properties of the 
individual family members. A typical family is the set of versions of an 
operating system distributed by a manufacturer.’ Such a family is also 
called a set of multiversion programs. Stepwise refinement enables 
multiversion programs to be developed conveniently and naturally. 

Multiversion programs may be built for the following reasons:'”"* 

(t) Economics. It is cheaper to build one program and then modify 
it to get another version than it is to build the second program from 
scratch. 

(11) Experimentation. Experimental prototypes may be built to study 
the feasibility of building a particular system. The experimental ver- 
sions along with the final program constitute the multiversions. 

(iit) Faulty program design. Another version of the program is built 
to correct the design faults of a prior version. 

Classically, multiversion programs have been built by first building 
one working version of a program. Another version is built by modify- 
ing this program and so on, as shown in Fig. 5. A set of multiversion 
programs produced as in Fig. 5 has one common ancestor. According 
to Parnas,"’ it is common for the descendants of one program to share 
some of their ancestors’ characteristics which are not appropriate to 
the descendants. In building the earlier version, some decisions were 
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INCOMPLETE 
PROGRAM 


X FINAL PROGRAM 


Fig. 5—Traditional way of building multiversion programs. 


made which would not have been made in the descendant programs 
had they been built independently. Removal of these decisions entails 
a lot of reprogramming. Consequently, programs have performance 
deficiencies because they contain decisions not really suitable for them. 
To build another program version, the program must first be complete 
and working. Relevant changes in an ancestor program that are not 
reflected in the descendant program cause maintenance problems. 

Stepwise refinement allows us to develop multiversion programs 
without the above problems. Never modify a complete program; always 
begin from one of the intermediate refinements which does not contain 
any design decisions unsuitable for the new version. This process is 
illustrated in Fig. 6. 


INITIAL REFINEMENT 


a REFINEMENTS 


X FINAL PROGRAM 





X 


Fig. 6—Multiversion program development by stepwise refinement. 
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All decisions made above a branch point are shared by the descen- 
dants. Refinements are developed so that common decisions of multi- 
version programs are above the branch point. A branch point results 
when two or more versions require different strategies (e.g., storage 
management techniques). 

Also refinements below a branch point may be carried out in 
parallel—we do not have to wait until a working program is available. 


VI. SUGGESTIONS FOR REFINEMENT 


1. Develop the program in a gradual seqitence of steps. 

2. In each step, refine one or more instructions of the given refine- 
ment. 

3. Terminate the refinement process when the instructions have 
been expressed in the desired programming language or when they can 
be mechanically translated to the programming language. 

4. Use information about the problem and its domain in the for- 
mulation of abstract instructions. 

d. Use notation natural to the problem domain. 

6. Make up abstract instructions as desired. However, they must 
eventually be translatable to an executable form. 

7. Make refinements reflect the instructions they represent in de- 
tailed form. 

8. Be aware that every refinement represents some implicit design 
decision and consider alternate solutions. Keep a written record of the 
major decisions made along with the refinements. 

9. Use recursion when appropriate. Even if the language does not 
support recursion, recursive solutions should still be considered. If 
recursive solutions are selected, they can be systematically converted 
to nonrecursive solutions. 

10. Use data refinement along with instruction refinement. 

11. Postpone representation of data as long as possible. This mini- 
mizes modifications to the design when an alternate representation is 
to be used. 

12. If an instruction appears more than once, use a procedure call; 
refine the instruction only once. Use procedure calls when they clarify 
program structure. 

13. Recognize abstract data types and separate their refinements 
from the rest of the program, i.e., do not refine the data type operations 
in line—use procedure calls. 

14. Try to use loop invariants to develop loops; they give a better 
idea of the instructions in the loop body and the guards. 

15. If a refinement solution does not turn out to be appropriate, 
repeat the refinement process using the additional knowledge derived 
from the previous attempt; stepwise refinement is an iterative process. 
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Vil. TOPICS RELATED TO STEPWISE REFINEMENT 


In this section, the idea of abstract data types is explained along 
with the concept of data refinement. This is followed by a discussion 
of the formal specifications of abstract data types. We then show how 
stepwise refinement may help simplify program correctness proofs. 
Finally, we briefly argue that stepwise refinement can be used in 
developing parallel programs. 


7.1 Abstract data types 


A data type is not only a set of values but also the operations that 
can be performed on them." A primitive data type is a data type that 
is available in the programming language. An abstract data type is a 
data type not available in the programming language and is imple- 
mented in terms of other abstract and primitive data types. 

The implementation of a data type consists of three parts—storage 
allocation, initialization, and the definition of operations. Consider the 
following implementation of the data type integer stack of size 100 in 
PL/I: 

(z) storage allocation 

DCL (S(100) 
NS /*NUMBER OF ELEMENTS IN S*/ 
)FIXED BIN; 
(zz) initialization 
NS = 0; 
(uz) Operations 


PUSH:PROCEDUREQ(, NA, X); 
DCL (A(100), NA, X) FIXED BIN; 
IF NA = 100 
THEN PUT SKIP LIST (‘OVERFLOW ERROR’); 
ELSE DO; NA = NA + 1; A(NA) = X; END; 
END PUSH; 


POP:PROCEDURE(A, NA); 
DCL (A(100), NA)FIXED BIN; 
IF NA =O 
THEN PUT SKIP LIST (‘UNDERFLOW ERROR ); 
ELSE NA = NA — 1; 
END POP; 


TOP:PROCEDURE(A, NA) RETURNS (FIXED BIN); 
DCL (A(100), NA)FIXED BIN; 
IF NA =O 
THEN PUT SKIP LIST(‘ERROR-STACK EMPTY’); 
ELSE RETURN(A(NA)); 
END TOP; 
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EMPTY: PROCEDURE(A, NA)RETURNS(BIT(1)); 
DCL (A(100), NA)FIXED BIN; 
RETURN(NA = Q); 

END EMPTY; 


Such implementations of abstract data types suffer from many 
disadvantages. Representation details are not hidden from the pro- 
grammer. This leads to the following: 

(1) Representation dependent programming, perhaps for “effi- 
ciency” reasons. For example, the programmer may directly inspect 
the top of the stack instead of using the procedure TOP; a change in 
the representation will then require changes in the program. 

(iz) Violation of the specifications of the abstract data type. For 
example, in case of the stack the programmer may delete an element 
in the middle of the stack. If such an ability is desired, then the 
specifications should be changed. 

(zit) Inadvertent or malicious violation of the integrity of the ab- 
stract data type. For example, NS could be set to 0 even if the stack is 
not empty. 

In addition, the programmer has to be concerned about which 
components of the representation have to be passed as arguments to 
the procedures associated wth the abstract data type. To be uniform, 
we have passed all the components of the stack representation for 
every operation. 

Modern programming languages such as cLU'*'® and Alphard' pro- 
vide data abstraction features, called clusters and forms, respectively, 
that remove the above problems. In addition, they support data 
refinement. We use CLU’s clusters to illustrate data refinement and 
give an example of programming with abstract data types. The nota- 
tion used is similar to that of CLU. 


stack = cluster is push, pop, top, empty; 
rep = record [ns:integer; 
S: array[1 -- 100] of integer] 
create = oper( ) returns cvt; 
s:rep 
S.ns:=0 
return s 
end 
push = oper(a:cvt, x:integer) 
if a.ns = 100 
then overflow error 
else begin a.ns := a.ns + 1 
a.s[a.ns] := x 
end 
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end 
pop = oper(a:cvt) 
ifa.ns =0 
then underflow error 
else a.ns :=a.ns — 1 
end 
top = oper(a:cvt) returns integer 
ifa.ns =O 
then stack empty error 
else return a.s.[a.ns] 
end 
empty = oper(a:cvt) returns boolean 
return a.ns =O 
end 
end stack 


Having defined the stack cluster, the programmer can declare vari- 
ables of type stack, e.g., b:stack. 

The first line of the stack definition states that the operations 
available to stack users are push, pop, top, and empty. The operation 
must be prefixed by the type name, e.g., the special operation create 
is automatically executed when a variable of type stack is declared. 

The line beginning “rep =” specifies that a stack is represented by 
an integer array and an integer. This information cannot be used 
outside the cluster, thus making the rest of the program representation 
independent. 

The special symbol evt (convert) means that the variable is of the 
abstract type outside the operation and of the representing type inside 
the operation. 

An operating system example 

Suppose we are writing an operating system. Jobs are to be sched- 
uled according to their priority (10 being the highest and 1 the lowest). 
The next job to be executed is the one with the highest priority. If 
there is more than one job with the highest priority, then the one 
selected for execution is the one that waited the most (FIFO): 

begin {operating system} 


Add job j with priority p to the list of 
jobs waiting for execution 


Wait until there is a job to execute 
let / be the next job to be executed 


end {operating system} 
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To implement the job scheduling we define a cluster called spq (system 
of priority queues), with operations add, empty , and next job. 


spq = cluster is add, empty, nextjob 
rep = array[1 -- 10] of queue 
create = oper( ) returns cvt 
s:rep 
return s 
end 
add = oper(s:cvt, job:integer, prty:1 -- 10) 
queue$add(s[prty], job#) 
end 
empty = oper(s:cvt) returns boolean 
i: integer := O 
while / ~ 10 do 
begin /:=/+ 1 
if queueSempty(s[/]) then return false 
end 
return true 
end 


nextjob = oper(s:cvt) returns integer 


i:integer := 11 
j:integer 
while / ~ 1 do 


begin /:=/-—1 
if ~ queueSempty(s[/]) 
then begin j := queue$front(s[/]) 
queueS$delete(s[/]) 
return / 
end 
end 
end 
end spq 


The cluster spq is implemented in terms of the abstract data type 
queue. For example, cluster spq’s operation add uses cluster queue’s 
operation add, i.e., queue$add. We refine instructions of the abstract 
data type queue by implementing a queue cluster. For this example, 
we assume that no more than 50 jobs of the same priority will be 
waiting at the same time. We shall use a wrap-around array represen- 
tation for the queue in which 

(t) an array of size 51 is used; only 50 elements can be stored in the 
queue, 

(ui) F = L means that the queue is empty, 
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(zzz) mod(F, 51) + L points to the next element in the queue, 
(iv) mod(L, 51) points to the last element in the queue, 
(v) mod(L, 51) + 1 = F means that the queue is filled. 


queue = cluster is delete, add, front, empty; 
rep = record[a:array[1 -- 51] of integer; F, L:integer] 
create = oper( )returns cvt 


q:rep 
q.F :=0 
qg.L :=0 
end 


delete = oper(q:cvt); 
ifg.F =q.L 
then error-empty queue 
else g.F := mod(q.F, 51) +L 
end 
add = oper(q:cvt, j:integer) 
if mod(g.L, 51) +1=q.F 
then error-queue full 
else begin g.L = mod(q.L, 51) + 1 
q.a{q.L|:=j 
end 
end 


front = oper(q:cvt) returns integer 
ifg.F =q.L 
then error-empty queue | 
else return qg-a[mod(F, 51) + 1] 
end 


empty = oper(q:cvt) returns boolean 
return g.Ff =q.L 
end 

end queue 


7.2 Formal specifications of abstract data types 

In this section, we consider the formal specifications of abstract data 
types. In particular, we take a brief look at the algebraic specification 
technique proposed by Guttag.'’~” Abstract data types are imple- 
mented in terms of other data types by the refinement or decomposi- 
tion of specifications. 

The algebraic specifications consists of two parts: 

(1) the syntax—here the operations of the data type are listed 
indicating the number of arguments, the argument types, and the 
result type. 
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(zz) the semantics—here axioms are given that relate the values 

created by the operations. 

~ In the basic notation which we will use, the operations are functions 
without side effects; none of the arguments are changed. Guttag”' has 
extended the notation to allow for changes in arguments, i.e., to allow 
procedures. 

We shall present algebraic specifications for the abstract data type 
stack considered earlier. We shall then give specifications for an array. 
Finally we shall refine the stack operations in terms of array operations. 
To keep the specifications as simple as possible, we shall consider 
unbounded (i.e., infinite size) stacks and arrays. 

Stack specifications: 


1. type stack 

2. syntax 

3: create( ) —_ stack 

4, push(stack, integer) — stack 
5: pop(stack) — stack 

6. top(stack) — integer 

7. empty(stack) — boolean 

8. semantics 

9, declare s:stack; x:integer 

10. pop(create( )) = underflow error 
11. pop(push(s, x)) = s 

12. top(create( )) = empty stack error 
13. top(push(s, x)) = x 

14. empty(create( )) = true 

15. empty(push(s, x)) = false 


Line 3: specifies the syntax of the create operation. The result of 
calling create, which has no parameters, is an object of type 
stack. 

Line 4: operation push takes as input a parameter of type stack and 
a parameter of type integer. It returns a value of type stack. 

Line 10: The result of applying the pop operation on a stack that has 
just been created is an underflow error. ‘=’ is the equality 
operator (not assignment). 

Line 11: The result of popping a stack s on which the last operation 
was to push a value x is the initial stack s. 

These specifications specify the abstract data type completely. For 
details on how to construct the specifications, see Ref. 20. We now 
give the specification for the type array which will be used to imple- 
ment the type stack. 


1, type array 
2. syntax. 
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9 createarray( ) — = array 
4. assign(array, integer, integer) — array 
5. access(array, integer) — integer 
6. semantics 
7. declare a:array; /, j, x:integer 
8. access(createarray( ), /) = undefined error 
9, access(assign(a, /, x), /) 
=ifi=/; 
then x 


else access(qa, /) 


Operation assign(a, 1, x) stands for an array whose 7th element has 
been assigned the value x. Operation access(a, 1) stands for the ith 
element of a. 

We now implement the abstract data type stack by decomposing 
stack operations in terms of array operations. We give axioms called 
programs that give the effect of stack operations in terms of array 
operations. 


1. stack implementation 

2. syntax 

3: STK(array, integer) — stack 

4. programs 

5. declare a:array, t, x:integer 

6. create( ) = STK(createarray(_ ), 0) 

rie push(STK(a, t), x) = STK(assign(a, t + 1, x), 
t+ 1) 

8. pop(STK(a, t)) =if t=O 7 


then underflow error 
else STK(a, t — 1) 
9. top(STK(a, t)) = if f= 0 
then empty stack error 
else access(a, ft) 
10. empty(STK(a, t)) = (t = 0) 


Given the formal specifications for an abstract data type, an initial 
inefficient implementation, called the direct implementation, can be 
automatically generated.'” Thus one can test some facets of a high- 
level data type before fixing upon a particular implementation. Thus 
a true top-down implementation methodology can be achieved. 


7.3 Program correctness 


Stepwise refinement provides a natural environment for reducing 
the problem of showing the correctness of a large program into showing 
the correctness of several smaller programs. 

A program is said to be correct if it meets its input and output 
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specifications (which may include performance criteria). Alternately, 
a correct program is one that transforms a state (1.e., data values) 
representing the input specifications into one representing the output 
specifications. A program is thus viewed as a specification transformer 
(Dijkstra® calls it a predicate transformer). Let 

(1) J and O be the input and output specifications for the problem 
being solved. 

(tt) Po be the initial problem formulation and P¢ the corresponding 
final program. 

Then we say that Py has been solved correctly if P¢ is correct with 
respect to J and O. If Po is a nontrivial problem, then proving the 
correctness of P¢ will be correspondingly nontrivial. 

Stepwise refinement allows the correctness proof of a program to be 
reduced to the correctness proofs of smaller programs. Suppose Pp is 
refined or decomposed into the subproblems Pio, Pi, «++, Pin, with 
each P); having specifications S; and S;+, (So = J and S,4; = 0). The 
problem of proving P¢é correct 1s now reduced to proving Pj; (0 <i 
n) correct, where Pj; is the program corresponding to P,;. Owicki”’ 
provides an example of such a correctness proof. 

In summary, stepwise refinement provides a natural medium for a 
difficult proof to be decomposed into several smaller proofs. A proof is 
any convincing demonstration of a program’s correctness. However, 
the conventional approach to understanding programs in terms of how 
computers execute them 1s inadequate. A more mathematical approach 
is needed even if it is used informally.~’ Alagic™* contains many exam- 
ples of programs designed with correctness proofs in mind. 


7.4 Parallel programs 


The development of parallel programs is no different than the 
development of sequential programs as far as stepwise refinement is 
concerned. Instead of using only sequential constructs, like begin S;, 
Se; +++; s, end in Pascal, we now use constructs for parallel program- 


ming,”” °° as shown below: 
(t) cobegin S,, So,..., S, coend 
The statements S,, S2,..., S, are executed in parallel. 
(iz) whenb, —- SL, 
[ |be —_ SLs 
[]b, — SL, 
end 


Wait till one of the guards b; is true and then execute the 
corresponding statement list 
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(tiz) cycle b, —- SL, 


[ |b2 = SL>o 
(lb, — SL, 
end 


Endless repetition of a when statement. 
If several guards are true within a when or a cycle statement, then 
one of the corresponding statement lists is executed nondeterministi- 
cally. 


Vill. CONCLUSIONS 


In this tutorial, we have tried to illustrate the stepwise refinement 
technique, its advantages, and related topics. Stepwise refinement can 
be learned easily with some practice. It blends in naturally with the 
newer concepts in programming languages and methodology (e.g., 
abstract data types, parallel programming, etc.). 

Stepwise refinement does not provide a solution to the problem. No 
methodology, old or new, is going to discover algorithms (1.e., problem 
solutions) for the programmer. The algorithms must come from the 
programmer’s education, experience, and ingenuity. 

Stepwise refinement encourages the development of a problem 
solution in a systematic fashion that is easy to understand, modify, 
and improve upon. The various refinements should not be discarded 
once the final program version is arrived at. They are part of the 
program documentation. Understanding the final program without 
them is hard even if the program is small (e.g., the eight-line final 
program version of insertion sort in Section ITI). 

The reader is urged to try stepwise refinement on some problems, 
especially large ones. 
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A New Code for Transmission of Ordered 
Dithered Pictures 


By O. JOHNSEN 
(Manuscript received September 22, 1980) 


This paper presents a new predictor for coding dithered pictures. 
A dithered picture, a two-tone picture which gives the illusion of a 
picture with many shades of grey, is obtained by comparing the grey- 
level picture with a position-dependent threshold. When the intensity 
of a picture element (pel) exceeds the threshold, it is classified as 
white; otherwise it is black. In the new prediction scheme, the color 
of a pel ts predicted from pels having nearly the same threshold level. 
Therefore, the position of the pels used for prediction varies according 
to the threshold level. Computer simulations show that prediction 
errors are reduced by 50 percent for certain classes of originals, and 
the entropy is reduced by 20 percent compared to the result obtained 
with a previous predictor. An advantage of this new prediction 
scheme is that it appears to be less sensitive to picture content. 


l. INTRODUCTION 


Dithering is an image processing technique that creates a two-level 
picture that gives the illusion of a multilevel picture by appropriately 
arranging the spatial density of the two levels (usually black and white) 
on the picture.” The dithering technique consists of comparing a 
multilevel image with a position-dependent threshold and setting pels 
to white when the input signal exceeds the threshold. Other pels are 
set to black. The matrix of threshold values (called the dither matrix) 
is repeated over the entire picture to provide the threshold pattern for 
the whole image. | | 

The merit of a dither matrix is judged from the quality of its 
rendition of the original picture. A class of dither matrices of special 
interest is the ordered dither matrices,’® which use a simple recursion 
algorithm to create dither matrices of size 2” x 2” to simulate 27" + 1 
brightness levels. 
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In the case of the 4 X 4 matrix, the 16 threshold levels are put in the 
following positions: 


0 8 2 10 
2 4 14 6 
Sat a or ”) 
16 7 13 5 


The ordered dither matrices have the advantage of rendering grey 
level pictures with as good a subjective quality as with empirical dither 
matrices.’ Moreover, we show that ordered dither matrices are espe- 
cially convenient for the proposed new predictor. Figure 1 shows three 
dithered pictures, “Karen,” “Engineer drawing,” and “House,” that 
are used for computer simulations. The originals for these pictures are 
10 cm by 10 cm and were scanned to generate an array of 512 by 512 
pels. The pels were digitized with a uniform 8-bit PCM code (256 levels). 
Matrix 1 then becomes 3 


8 136 40 168 
200 72 232 104 
56 184 24 152|” 2) 
248 120 216 88 


Note that Figs. la, b, and c were made from digitizing the same 
originals as used in Refs. 4 and 6 but not from the same digitized 
versions. 

Efficiently coding the bits of the dithered image reduces the data 
rate. If the pels of the dithered picture are sampled in a typical raster 
scan fashion, the frequent alternations between black and white pels 
prevent the direct use of run-length coding, which is an efficient 
redundancy reduction scheme.* Others have proposed a modification 
to the straightforward raster scan sampling or to the direct use of the 
picture pels. Judice proposed a different sampling order, a bit inter- 
leaving code.* Netravali et al.° devised a predictive coding technique 
that is an extension of their two-level facsimile coding scheme.’ In 
both cases, the binary picture containing the interleaved bits or the 
prediction errors can be efficiently run-length coded. In Netravali’s 
coding technique, the prediction of the value of a pel (1+ white, 0= 
black) 1s made dependent on the values of four neighbor pels as well 
as to the threshold level. Figure 2 shows the ordered dither matrix of 
(2) and the prediction that he used. With a dither matrix of 16 
threshold levels, and a prediction from 4 neighboring pels, the predic- 
tion table has 256 possible states. This prediction scheme takes advan- 
tage of the correlation between adjacent pels, but it can only partially 
exploit the much stronger correlation that exists between neighboring 
pels with the same or similar threshold level. 

The scheme proposed in this paper predicts a pel according to the 
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Fig. la—Dithered test picture of Karen. 


value of nearby pels of the same or similar threshold level. The new 
predictor gives for two of the three test pictures 50 percent fewer 


prediction errors compared to the previous one, and 10 percent fewer 
prediction errors for the third test picture. 


ll. PREDICTION PRINCIPLES AND ALGORITHMS 
2.1 Prediction principle 


A pel can best be predicted from the following information: 

(a) the threshold level of the pel itself, 

(b) the values of the nearest (already coded) pels with similar 
threshold levels. 

We use four previous pels for prediction and consider pictures which 
have been dithered using the 4 x 4 ordered dither matrix of (2). There 
are then 16 threshold levels and since each pel used for prediction has 
only two possible values, 256 states are defined by all the possible 
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Fig. lb—Dithered test picture of Engineering drawing. 


combinations of the predictor and of the threshold level. The predictor 
code book contains 256 bits, each indicating whether the pel is pre- 
dicted to be black or white. The code book is simply obtained by 
statistical measurement. From these predictions, we generate a new 
picture where the pels correctly predicted are put to “0” while the pels 
incorrectly predicted are put to “1.” The new picture, called the “error 
picture,” can easily be run-length coded and transmitted efficiently. 
Knowledge of the predictor code book allows the receiver to recon- 
struct the original dithered picture. The four pels used for prediction 
are: | 

(a) the two nearest pels with the same threshold level, 

(6) the nearest pel with the next higher threshold level, 

(c) the nearest pel with the next lower threshold level. 
For cases (6) and (c), when two pels satisfy the rule at the same time, 
the pel with the smallest correlation with the other prediction pels is 
chosen. Also, for case (6), if there are no pels with a higher threshold 
level, a second pel with the next lower threshold level is used for 
prediction. A similar situation can appear in case (c). 


382 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1981 


2.2 Prediction algorithm 


Let the pel to be predicted, S;;, have position (7; 7), where 1 is the 
line number, 7 is the position on the line, and z and 7 are increasing 
down and to the right, respectively. The two nearest pels with the 
same threshold level naturally have coordinates (i — 4; 7) and (1; 7 — 
4) since a4 X 4 dither matrix is used. 

To find the two other pels used for prediction, we must look at the 
structure of the 4 < 4 ordered dither matrix. The procedure is described 
in the appendix. We give here only the result, showing that the 
coordinates of the four pels used for the prediction are 


l uy— 4, 

2. 1-4 y, 

3. L—2:7 + 2, 
4. variable. 


The position of the prediction pel with variable position is given 
according to the position of Sij within the dither matrix by 
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Fig. lc—Dithered test picture of House. 
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Fig. 2—Picture elements used for prediction, where Sy is the picture element to be 
predicted, and W, X, Y, and Z are picture elements used to predict Si. 





Lege 2. 2a yy bey EZ 
t—-37+1 i-37y-1 i-27 t-2;7 
t-ljy+l i-lyy-1l i-27 1-237 (3) 
\i-2 j-2 i— 337 1—-2;7 1-237 


Figure 3 shows the positions of the four pels used for the prediction. 

Note that for the eight pels in the two right rows of the matrix the 
prediction pel with variable position is the same. 

This new predictor can be called a position-dependent predictor 
since one of the pels used for the prediction is variable. In the case 
where using a position-dependent predictor is undesirable, the predic- 
tor size can be reduced and only three pels can be used, or the fourth 
pel used in the prediction can be fixed to the most-used position (1 — 
2; 7). Both cases lead to a slight increase in the number of prediction 
errors. 


2.3 Extension to other dither matrices 

The same prediction principle can be applied to different dither- 
matrices or different sizes of matrices, but with care. For example, 
when applying the same rule to an ordered 8 X 8 dither matrix, three 
of the four pels of the predictor are fixed and their locations, compared 
to the pel to be predicted, are in the same direction but twice as far as 





Fig. 3—Position of the pels used for the prediction, where Sy is the pel to be predicted, 
A, B, and C are the three fixed pels used for predicting, and the D’s are the different 
positions occupied by the pel with variable position used for prediction. 
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fora 4 X 4 matrix. Their locations are 


l. uy 8, 
2. t— 8, 
3. 1-457 +4. 


The prediction pel with variable position again has the same relative 
position for the right half of the matrix, the coordinates being (1 — 4; 
J). For the other half, the position is variable, but often the same 
within each 2 X 2 submatrix. The position matrix of the prediction pel 
with variable position can be constructed by looking at the structure 
of the 2 X 2 and 4 X 4 submatrices. 

It is questionable whether such a predictor gives good prediction. 
The prediction is now made according to the value of the pels twice as 
far away, aS in the case of a 4 X 4 dither matrix. The error rate is 
therefore likely to increase. The best solution is to use the same 
prediction algorithm as for the 4 xX 4 dither matrix, since the 8 X 8 
dither matrix is composed of four 4 x 4 dither matrices whose threshold 
levels are only very slightly different. The number of prediction errors 
would then be only slightly higher than if the picture was dithered 
with a 4 X 4 matrix. A slight decrease in prediction errors can be 
obtained by considering all 64 threshold levels instead of 16 for 
constructing the prediction table. The prediction table would then 
contain 1024 states instead of 256. 

In the case of other dither matrices (nonordered) the prediction 
principle is the same but the algorithm is different, since the position 
of the nearest pels with similar threshold level changes. In most cases, 
two of the four pels used for prediction would have a variable position, 
thus showing the advantage of using the ordered dither matrices with 
this prediction scheme. 

This prediction scheme can be extended by adding more pels to the 
predictor; for example, using pels closer to the pel to be predicted, 
thereby decreasing the number of prediction errors along a sharp edge. 


lll. SIMULATION RESULTS 


Computer simulations are performed using the three dithered pic- 
tures of Fig. 1. The results are compared to those obtained using the 
technique proposed by Netravali et al.° Two measures of performances 
are made: counting the number of prediction errors and measuring the 
entropy of the run-length statistics of the picture containing the 
prediction errors. The entropy is converted into bits per pel. The run- 
length entropy measured is the classical run-length entropy, given for 
example in Ref. 6. 

Table I gives the number of prediction errors with the new predictor. 
For comparison, the number of prediction errors with the technique in 
Ref. 6 is also given. Compared with the predictor of Ref. 6, the number 
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of prediction errors is reduced by 53 percent for “Karen” and “House” 
and 10 percent for “Engineering drawing.” 

Table II gives the entropy comparisons. Four entropies are given. 
The first is the entropy of the run-length statistics of the prediction 
error when using separate prediction tables for each picture. The 
second is the same entropy in the case where a single prediction table 
is used for all three pictures (the prediction table is optimized to the 
sum of the three pictures). 

The third entropy is the entropy of the run-length statistics of the 
prediction errors when the prediction errors are ordered according to 
the probability of error® (good-bad ordering). In the good-bad ordering 
the prediction errors of the pels of a line are filled in the right side of 
a line if their error probabilities are high, while they are put on the left 
side when their error probabilities are low.° The goodness threshold 
used is 0.05. 

The fourth entropy is the same as the latter but with a single 
prediction table for all three pictures. 

The entropy measurements show the great improvements obtained 
by this new prediction technique compared to the results from Ref. 6. 
For “Karen” and “House” the reduction in the run-length entropy is 
25 to 30 percent, but it is limited to 3 percent for “Engineering 
drawing.” When a single prediction table is used, the reduction in 
entropy is 28 percent for “Karen” and “House” but it reaches 10 
percent for “Engineering drawing.” In the case of good-bad ordering, 
the decrease in entropy with the new prediction technique compared 
to Ref. 6 is 17 to 24 percent for “Karen” and “House” while an increase 
of 2 percent appears for “Engineering drawing.” When a single predic- 
tion table is used for all pictures, the decrease is 24 percent and 17 
percent for “Karen” and “House” and 3 percent for “Engineering 
drawing.” 

The gain obtained with the new predictor for “Karen” and “House” 
is very different for “Engineering drawing.” It can be explained by the 
picture characteristics. The “Karen” and “House” pictures are really 
half-tone pictures with mostly gradual changes in brightness while 
“Engineering drawing” is a graphical picture containing mostly steep 


Table |—Comparison of the number of 
prediction errors 


Prediction 
Prediction Error with 
Errors with Predictor of 
Pictures New Predictor Netravali 
Karen 12,613 26,007 
Engineering drawing 25,742 28,526 
House 12,106 26,156 
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Table I|—Comparison of entropies (in bits/pel) 


Engineering 
Karen Drawing House 
I. New Predictor | 
Run-length coding with separate pre- 0.221 0.396 0.214 
diction table for each picture 
Run-length coding with a single pre- 0.228 0.403 0.216 
diction table 
Run-length coding with good-bad or- 0.196 0.376 0.187 


dering with separate prediction 
table for each picture 
Run-length coding with good-bad or- 0.202 0.380 0.191 
dering and a single prediction table 
II. Predictor of Netravali 


Run-length coding with separate pre- 0.317 0.409 0.286 
diction table for each picture 

Run-length coding with a single pre- 0.317 0.446 0.303 
diction table 

Run-length coding with good-bad or- 0.259 0.367 0.225 


dering and separate prediction 
table for each picture 

Run-length coding with good-bad or- 0.278 0.393 0.253 
dering and a single prediction table 


changes in grey level. These steep changes are well predicted with 
Netravali’s predictor and therefore practically no improvements are 
obtained with the new predictor. | 

An advantage of this new technique is that the predictor is quite 
independent of the characteristics of individual pictures. The entropy 
with the predictor optimized for the sum of the three pictures is less 
than 2 percent higher than with individual predictors for each picture. 
With Netravali’s predictor the entropy is sometimes 10 percent higher 
when the predictor is optimized for the sum of the three pictures. 

The advantage of using a good-bad ordering is smaller with the new 
predictor, the decrease in entropy being about 10 percent. We empha- 
size however, that the good-bad ordering leads to a nearly monotoni- 
cally decreasing run-length distribution and therefore the runs are 
easy to code, while without ordering, the run-length distribution 1s 
very irregular. Straightforward run-length codes can be devised which 
code the pictures obtained after prediction and reordering with an 
efficiency of about 90 percent. 


IV. SUMMARY AND CONCLUSION 


We have described a new predictive coding scheme for dithered 
pictures where the pels used for prediction have variable positions. 
The prediction is made according to the value of nearby pels that have 
nearly the same threshold level and according to the value of the 
threshold of the pel to be coded. We obtain a decrease of 20 to 30 
percent in the run-length entropy compared to an earlier prediction 
scheme in the case of dithered pictures of natural scenes. The bit rate 
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is reduced to 0.18 to 0.23 bits/pel. We obtain a slightly higher entropy 
when we fix the position of the pels used for prediction. In the case of 
graphical pictures, the entropy is about the same as before. The new 
predictor has the advantage of being nearly independent of the char- 
acteristics of individual pictures. In the case of a coding system, the 
run-length distributions must be used to propose a code whose perfor- 
mances should be near those given by the entropy measures. This 
prediction scheme was conceived for images dithered with a 4 X 4 
ordered dithered matrix, but can be easily applied to other dither 
matrices. We envision extensions that would improve the performance 
in regions that have sharp changes in brightness. 


APPENDIX 
Determination of the Four Neighbor Pels used for Prediction 


The two nearest pels with the same threshold level naturally have 
coordinates (1 — 4; 7) and (1; 7 — 4) since a 4 X 4 dither matrix is used. 

To find the two other pels used for prediction, we examine the 
structure of the 4 X 4 ordered dither matrix. Let us number the four 
2 X 2 matrices contained in the 4 X 4 dither matrix as 0, 1, 2, and 3 
according to the increase in their average threshold value. Figure 4 
shows their relative positions. We must also remember that the 4 x 4 
matrix is surrounded by identical matrices whose pels can also be used 
as prediction pels. 

For a pel in submatrix 1, the neighboring pel with the next-higher 
threshold level will be in the same relative position in submatrix 2 with 
coordinates (z — 2; 7). The neighbor with the next-lower threshold 
level will be in the same relative position in submatrix 0. That neighbor 
can be either in submatrix 0 of the same 4 X 4 matrix or in submatrix 
0 of the matrix adjacent to the right. Both cases lead to the same 
distance from the neighbor to the pel to be predicted, but the latter 
case leads to the smallest correlation with the other prediction pels 
and is therefore chosen. That prediction pel has position (1 — 2; 7 + 2). 

Similarly, if the pel to be predicted is in submatrix 2, the two pels 


ee pee 
a a ar se a aa ae ara Foray ia ee re 
; 8 136 | ; 40 168 | 
| i 
| 2000 72. p22 104 | 
| 56 1844 | 2 4162] 
| | { 
| | | | 
fe ett et 120 | ,2i6 88 


Fig. 4—4 x 4 dither matrix with the numbering of its 4 submatrices 0, 1, 2, and 3. 
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with the next threshold level chosen for the prediction have coordi- 
nates (i — 2;7) and (z — 2,7 + 2). 

For a pel in submatrix 3, the pel with the next-lower threshold level 
will be in the same relative position in submatrix 2. As earlier, there 
are two possibilities, but correlation comparisons leads us to choose 
the pel with coordinate (1 — 2; 7 + 2). The neighbor with the next- 
higher threshold level will be in submatrix 0, but in each case in a 
different relative position, as can be verified from Fig. 4. For the pel 
with threshold level 248, pels with a higher threshold level do not exist. 
Therefore, the two nearest pels with the next lower-threshold level are 
chosen as prediction pels. 

Similarly, if the pel to be predicted is in submatrix 0, the pel with 
the next-higher threshold level chosen for prediction has coordinates 
(t — 2,7 + 2), while the pel with the next-lower threshold level has in 
each case a different relative position. In this case, when the pel with 
threshold level 8 is predicted, the two nearest pels with the next-higher 
threshold level are chosen as prediction pels. 

In summary, the positions of the pels used in the prediction are: 


l uy—4, 
2, 1-4, 
3. L—2;7 + 2, 


4. variables. 


The position of the prediction pel with variable position 1s given 
according to the position of Siyj within the dither matrix by 


i-2%j7-2, i-lkj i-2j i-27 
i-3;j7+1 i-3j-1 i-2;7 1-237 
i-lyjtl1 i-lyyj-1 i-27 t-237 
i—-27-2 i-37 =o PS 27 
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An Extension of the CCITT Facsimile Codes for 
Dithered Pictures 


By O. JOHNSEN and A. N. NETRAVALI 
(Manuscript received September 24, 1980) 


The International Telegraph and Telephone Consultative Com- 
mittee (CCITT) has recently recommended two redundancy reduction 
codes for the digital transmission of two-level facsimile. It is of 
interest to transmit other types of still pictures using these codes. In 
this paper we develop a method for bilevel dithered pictures, wherein 
pictures are preprocessed reversibly to permit efficient coding by the 
CCITT codes. The preprocessing consists of local rearrangement of 
picture elements to obtain large contiguous black and white areas. 
The main advantage of these schemes over the existing ones is that 
they require the addition of a simple processor to a standardized 
facsimile coder. One of the schemes has the additional advantage 
that even in the absence of the postprocessor at the receiver, a 
distorted but recognizable picture is obtained. A compression ratio of 
1.5 to 3.5 ts obtained with these methods, which is comparable with 
other coding schemes. 


l. INTRODUCTION 


The International Telegraph and Telephone Consultative Commit- 
tee (CCITT) has recently recommended two redundancy reduction 
codes for the digital transmission of two-level facsimile over the general 
switched telephone networks.’ In the future, most facsimile apparatus 
using redundancy reduction techniques will use these codes. The first 
code is a one-dimensional run length code called the modified Huffman 
code (MHC). The second code is a two-dimensional extension of the 
modified Huffman code, called the modified READ code (MRC). 

Dithering 1s an image processing technique which creates a two- 
level picture that gives the illusion of a multilevel picture by appro- 
priately controlling the spatial density of black and white picture 
elements.”° It is useful in systems that use inherently bilevel displays 
(e.g., plasma panels). Dithering allows a picture to be transmitted as 
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a two-level picture, thus greatly reducing the number of bits to be 
transmitted. 

The high frequency components of a dithered picture prevent the 
use of CCITT codes as an efficient redundancy reduction scheme.° 
Judice,’ Netravali et al.,’ and Johnsen® have proposed several coding 
techniques for dithered pictures. Judice’ has proposed a bit interleav- 
ing scheme which regroups pels with the same or similar threshold 
levels, thus allowing the use of run-length coding. 

We are interested in coding schemes for dithered pictures that would 
require only a slight modification to the standardized one-dimensional 
and two-dimensional CCITT codes. Our aim is to include only a simple 
preprocessor at the transmitter and a postprocessor at the receiver. 
The preprocessor would include a dithering processor which trans- 
forms the analog signal from the facsimile scanner into the binary data 
stream corresponding to the dithered image and a precoder which 
modifies the dithered image in such a way that it can be efficiently 
coded by the one-dimensional or two-dimensional codes. We only 
consider reversible transformations. The postprocessor of the receiver 
must make the inverse transformation. One of the preprocessing 
schemes that we propose has an interesting property that the ordered 
picture is visually similar to the original dithered picture. Therefore, 
a facsimile receiver not containing the postprocessor will be able to 
reproduce a picture that is in most cases recognizable. 


Il. ORDERED DITHER 


The dithering technique consists of comparing a multilevel image 
with a position dependent threshold and turning only those picture 
elements “on” (or “1”) where the input signal exceeds the threshold 
value. The square matrix of threshold values (called the “dither 
matrix”) is periodically repeated over the entire picture to provide the 
threshold pattern for the whole image. 

Several dither matrices can be used. They have been judged on the 
basis of subjective fidelity of reproduction. A class of dither matrices 
of special interest are the “ordered dither matrices.”* A dither matrix 
of size n X n simulates n? + 1 brightness levels where n is a power of 
2. Thus, 17 brightness levels can be simulated with a 4 x 4 ordered 
dither matrix and 65 brightness levels with a 8 X 8 matrix. A 4 x 4 
matrix, with 16 threshold levels, is shown in eq. (1), 


0 8 2 10 
2 414 6 
3111 9 (1) 


1 7 13 = «5 


Figure 1 shows dithered pictures, “Karen,” “Engineering drawing,” 
and “House,” used for computer simulations. These pictures are 10 cm 
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Fig. 1la—Dithered test picture of Karen. 


by 10 cm and are scanned to generate an array of 512 by 512. The 
picture intensity is linearly quantized to 8 bits (256 levels) and then 
dithered. The following dither matrix, obtained from eq. (1) by ex- 
panding the threshold levels to cover the entire span of picture inten- 
sity, 1s used: 


| 8 136 40 168 
200 72 232 104 (2) 

56 184 24 152 

248 120 216 88 


We mention that the three pictures of Fig. 1 are made from the same 
originals as used in Refs. 4 and 6, but they are not the same digitized 
versions. 


lll. REORDERED PEL CODING 


Since the codes to be used are already known, the goal of the 
ordering is to transform the picture to minimize the coding length. In 
the one-dimensional case, the runs must be as long as possible. In the 
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Fig. Ib—Dithered test picture of Engineering drawing. 


two-dimensional case, the vertical correlation between runs must also 
be maximized. 

The pel reordering is related to the bit interleaving method proposed 
by Judice.° The main difference is that now the reordering of the pels 
is done locally. The ordering is made within a few 4 X 4 blocks at a 
time, typically 1 to 4 blocks. The difference can be seen from the 
relation between neighboring pels of the ordered pictures. In Judice’s 
schemes, neighboring pels are pels with similar threshold levels of 
adjacent blocks, while in the local ordering scheme, neighboring pels 
are pels of nearly the same threshold level of the same or neighboring 
blocks. Therefore, bit interleaving exploits better the correlation 
among pels with similar threshold levels, while pel ordering exploits 
better the correlation among the neighboring pels. 

We propose two closely related kinds of pel reordering. One of them 
is optimized for run-length coding, while the other is optimized for the 
two-dimensional codes. They have both been conceived for 4 X 4 
ordered dither matrices, but in the case of larger dither matrices, 
simple modifications can be made. 
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Fig. l1c—Dithered test picture of House. 


3.1 Pel reordering for run-length coding 


Let us consider the 4 X 4 dither matrix of eq. (1). To have as few 
runs as possible, the pels with nearly the same threshold levels should 
be contiguous on the same line. One way to reduce the number of runs, 


is to reorder the dithered pels for two consecutive blocks slightly 
differently, for example, 


Block A Block B 
QO 4 2 8 3. 2 1 O 
4 5 6 7 7 6 5 4 (3) 
8 9 10 I1 11 10 9 8 
12 138 14 15 15 14 13 12 


Another possibility is to reorder a 4 X 4 block into a 2 X 8 block. Two 
blocks must therefore be reordered at the same time. For reasons of 
C 


efficiency, four blocks, st 5» are reordered as shown in Table I. 


Figures 2 and 3 show the ordered picture of Karen with 4 x 4 and 2 
xX 8 reordering. It. can be seen that these reordered pictures can be 
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A'C 
B1D 


Table I—Reordering of four blocks 





run-length coded efficiently because there are many long runs. Also 
note that the reordered pictures are visually degraded versions of the 
original dithered pictures. The 4 X 4 reordering gives smaller degra- 
dation than the 2 X 8 ordering. 


3.2 Pel reordering for two-dimensional coding 


Pel reordering for two-dimensional coding should increase the ver- 
tical correlation between runs, i.e., the reordered picture should have 
mostly vertical black and white strips. This is accomplished within 
each block by moving the pels most likely to be white to the left side 
and the pels most likely to be black to the right side and reversing this 
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Fig. 2—Karen after 4 X 4 reordering for run-length coding. 
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Fig. 3—Karen after 2 x 8 reordering for run-length coding. 


process for the next block. The reordering of two blocks, A‘ B, would 
be 


Block A Block B 
0 4 8 12 15 ll 7 38 
1 5 9 18 14 10 6 2 (4) 
2 6 10 14 13 9 5 1 


3 7% 11 15 12 8 4 QO 


Further improvements are possible, but the coding efficiency with 4 
x 4 reordering is low compared to the other reordering schemes. 
Improvement in the coding efficiency is obtained by 8 X 2 reordering. 
This type of reordering slightly reduces the vertical redundancy be- 
tween the runs, but that is largely compensated by the reduction in 
the number of runs. To exploit the maximum amount of correlation, 
two groups of four blocks of 4 X 4 pels must be considered. These 
blocks, named A, B, C, D, A’, B’, C’, and D’, are shown below: 


(5) 
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Table Il—Reordered configuration of eq. (5) 





Table II shows the reordered configuration. 

Another type of reordering is the 16 X 1 reordering. Now the pels 
are arranged in order of increasing or decreasing threshold level. Eight 
4 x 4 blocks shown below, 


A |B|c |p) #| F/G |x). 
are transformed into: 


1... 14 15 15 14... 
Bol... 14 15 15 14... 
C 01... 14 15 15 14... 
DO1... 14 15 15 14... 


(6) 


a 
OO O'S 
mo Sy & 


A slight improvement can be obtained by reversing every second group 
of four lines. Figures 4 and 5 show the picture “Karen” transformed by 
8 x 2 and 16 X 1 reordering, respectively. 


3.3 Implementation considerations 


All the above reordering schemes are simple to implement. Except 
in the case of 2 X 8 reordering for one-dimensional coding, only four 
lines with a maximum of 16 pels per line are processed at a time. The 
basic operation is to address the bits in a different sequence. The 
reordering should add very little to the cost of the coder and decoder, 
certainly less than the dithering operation which itself is quite simple. 


IV. COMPARISON OF PERFORMANCES 


The coding length in bits per pel have been measured for the various 
reordering schemes and for the bit interleaving schemes of Judice.® 
The three test pictures of Fig. 1 were used. Only the one-dimensional 
and the two-dimensional codes recommended by the CCITT for stand- 
ardization' are considered. The end of line codeword has been sup- 
pressed in the comparisons. In the two-dimensional case, all lines, 
except the first one, are coded with the two-dimensional code. Note 
that all the reordering schemes are two-dimensional, even when a one- 
dimensional code is used afterwards. 
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Fig. 4—Karen after 8 x 2 reordering for two-dimensional coding. 
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Table IIl—Coding length in bits per pel of the three test pictures when they are reordered and coded with the one- 
dimensional (1-D) and two-dimensional (2-D) codes proposed by the CCITT for standardization 
Engineering 
Karen Drawing House 
1-D Code 2-D Code 1-D Code 2-D Code 1-D Code 2-D Code 
1-D bit interleaving 0.433 0.607 0.365 
2-D bit interleaving 0.390 0.328 0.592 0.609 0.327 0.291 
4 X 4 reordering for run-length coding 0.395 0.661 0.383 
2 < 8 reordering for run-length coding 0.381 0.633 0.372 
4 X 4 reordering for 2-D coding 0.514 0.701 0.475 
8 xX 2 reordering for 2-D coding 0.321 0.642 0.309 
16 X 1 reordering for 2-D coding 0.306 0.621 0.301 
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Fig. Ga—Karen after 4 x 4 reordering for run-length coding. 


a «z 


eelZaltantatJ 
pe ee he head 
ana 
laltalsatlettal 


LS waste chatty it 
PA THUINETR 


a 


oT malalde! 


om he 


sot pegeege 





: 


ing. 


for run-length cod 


ing 


drawing after 4 x 4 reorderi 


Incermsg 


Fig. 6b—Eng 


Hit 
: 
; 


mdal 
| 
{ 


i 
| 


[fe 
| 
| 
| 


nt 
jil 
rel 


| 


i 


Ih 


Hlagegeaqay 


: 
! 
: 


, 


| 


~ 
| 


! 

















| 
| 








Fig. 6¢-—House after 4 < 4 reordering for run-length coding. 


Table III shows the coding lengths. In the case of the one-dimen- 
sional CCITT code, the compressions with the 4 xX 4 and 2 X 8 
reordering are slightly lower than with the two-dimensional bit-inter- 
leaving scheme of Judice. Interest in the 2 X 8 reordering is small 
because, compared to the simpler 4 < 4 reordering, the decrease in 
coding length is less than 5 percent. 

With the two-dimensional CCITT code, the two-dimensional bit 
interleaving and the 16 X 1 reordering lead to similar coding lengths. 
The reordering scheme seems preferable since it is simpler. The two- 
dimensional bit-interleaving scheme requires either the storage of the 
entire picture or several scans of the picture. The storage can be 
reduced to four lines without sacrificing the coding performance, but 
this requires modifying the CCITT code so that the fourth previous 
line is used as a reference line rather than the previous line. 

One advantage of the 4 < 4 reordering for run-length coding is that 
the reordered picture is similar to the original dithered picture. Figure 
6 shows the reordered versions of the three test pictures. Except for 
the “Engineering drawing,” the other two pictures have reasonable 
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Table 1!V—Entropies in bits per pel for the three test pictures when they are reordered 


-D bit interleaving 

-D bit interleaving 

x 4 reordering for run-length coding 
x 8 reordering for run-length coding 
x 4 reordering for 2-D coding 

x 2 reordering for 2-D coding 


6 X 1 reordering for 2-D coding 


Engineering 


Karen Drawing House 
Run- Run- Run- 
length 2-D length 2-D length 2-D 
Entropy Entropy Entropy Entropy Entropy Entropy 

0.313 0.429 0.276 

0.293 0.272 0.420 0.478 0.256 0.245 

0.302 0.468 0.293 

0.266 0.433 0.262 
0.436 0.608 0.388 
0.271 0.526 0.265 
0.247 0.500 0.243 


quality. Thus, a facsimile receiver without the dither postprocessor is 
able to reproduce a recognizable approximation of the dithered picture. 

Table IV gives entropy measurements for comparison purposes. The 
two-dimensional run-length entropy is obtained from the distribution 
of the symbols used in the two-dimensional CCITT code. The entropies 
are about 15 to 20 percent lower than the corresponding coding lengths 
of Table III. Improved codes leading to coding lengths only 10 percent 
higher than the entropy can be devised, but are outside the scope of 
this study. Entropy comparisons with Ref. 8 show that higher compres- 
sion can be obtained by more sophisticated techniques. 


V. CONCLUSION 


We have shown that dithered pictures can be transmitted efficiently 
with facsimile apparatus that uses either the one-dimensional or the 
two-dimensional codes proposed by the CCITT for standardization. 
The addition of a simple reversible preprocessor at the transmitter and 
a postprocessor at the receiver 1s required. The number of bits trans- 
mitted is 1.5 to 3.5 times lower than without coding. This compares 
favorably with compression ratios obtained by other schemes. The 
preprocessor consists of local reordering of pels. One of the pel reor- 
dering schemes has the advantage that a recognizable version of the 
original dithered picture is obtained with facsimile receivers not con- 
taining the postprocessor. This study has been made using only three 
small dithered pictures and a 4 X 4 dither matrix. Additional studies 
using a larger set of pictures and a larger dither matrix are necessary 
before implementation. 
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A 200-Hz to 30-MHz Computer-Operated 
Impedance/Admittance Bridge (COZY) 


By L. D. WHITE, R. W. COONS, and R. C. STRUM 
(Manuscript received September 12, 1980) 


For the past few years the development of ferromagnetic compo- 
nents, particularly for long-haul transmission systems, has relied 
heavily on large numbers of highly accurate impedance measure- 
ments made on a computer-operated impedance/admittance bridge 
(cozy) developed especially for this work. cozy ’s accuracy and speed 
enable a level of component development not otherwise possible. cozy 
automatically measures complex impedance, temperature coefficients 
of complex impedance, and disaccommodation factors of ferromag- 
netic materials, providing accuracies of +0.05 percent for inductance, 
+50 microradian for loss angle, and +10 parts per million for the 
small impedance changes associated with determinations of temper- 
ature coefficients and disaccommodation factors. COZY 1s easy to use 
and makes a measurement in 10 to 20 seconds. Also, the calibration 
of the bridge unit’s capacitance and conductance standards can be 
checked automatically. Though developed primarily for ferromag- 
netic component work, cozy is a general-purpose bridge; it measures 
inductance, capacitance, resistance, and conductance over wide 
impedance ranges at frequencies between 200 Hz and 30 MHz. This 
paper describes cozy’s hardware, software, and performance. 


l. INTRODUCTION 


A computer-operated impedance/admittance bridge (cozy) has been 
developed to have the following features: 

® wide frequency range—200 Hz to 30 MHz in 0.01-Hz steps 

® wide impedance/admittance range—from a resolution of 0.1 na- 
nohenry for small impedances to a resolution of.0.001 picofarad 
for small admittances 

® high accuracy—high-@ unknowns can be measured to +0.05 per- 
cent for inductance/capacitance and +50 microradians for loss 
angle. Changes in impedance/admittance (with temperature, 
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time, shock, and vibration, etc.) can be measured to +10 parts per 
million. 

® specifiable signal level—voltage or current may be specified over 
the nominal range of 0.05 to 5 volts for impedances larger than 
100 ohms and 0.5 to 50 milliamperes for smaller impedances. The 
achieved level is within +10 percent of the requested level and is 
measured to +3 percent. 

® relatively fast—20 seconds per measurement 

@ easy to use yet flexible—the bridge has only a single pair of 
binding posts to which the unknown is connected and the user 
needs to specify only a test frequency. However, the user can 
specify signal levels, frequency runs, and various options for post- 
processing of the measurement results. 

® the options for runs and postprocessing can be changed easily— 
the software clearly separates the options from the basic mea- 
surement process. 

® automatic aids for maintaining high accuracy—in particular, the 
calibration of the bridge unit’s standards can be automatically 
checked. 

A microcomputer-controlled environmental chamber with an 18- 
sample capacity is appliqued to cozy. The combined system provides 
the following additional features: 

® Highly accurate automatic measurements of the changes of the © 
samples’ impedances/admittances with environmental condi- 
tions—temperatures may be specified to +0.1° Celsius between 
—40° and 93° Celsius and relative humidities to +2.5 percent 
between 20 and 95 percent, with a minimum dew point of 2.5° 
Celsius. Soak times, signal levels, multiple frequencies, and envi- 
ronmental runs can also be specified. Average time for a single 
measurement is 10 seconds. 

@ Automatic meaurements of the disaccommodation factors (the 
decrease in permeability with time after demagnetization) of 
ferromagnetic materials—peak demagnetization currents up to 2 
amperes, with a 10-volt maximum, can be specified. 

COZY was developed to provide the measurements required in fer- 
romagnetic component development work. Large numbers of highly 
accurate measurements are required to evaluate materials, structures, 
and whole components over their operating ranges of frequency, signal 
level, and environmental conditions and to determine the effects of 
aging, shock, and vibration. 

The most crucial of the measurement requirements that led to the 
development of cozy were: (1) a basic precision of significantly better 
than ten parts per million to achieve the desired accuracies in measur- 
ing high @-values and small changes in impedance, and (2) a measure- 
ment time much less than a minute to provide the desired quantity of 
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measurements. The measuring systems that come closest to meeting 
these requirements are specially developed manual bridges’ and the 
50-Hz to 250-MHz computer-operated transmission measuring sys- 
tem.” The manual bridges have satisfactory precision but require many 
minutes and much care and expertise for a measurement. The com- 
puter-operated transmission measuring system, on the other hand, is 
amply fast but has a basic precision of approximately 100 parts per 
million. 

To meet the objectives for precision requires bridge techniques; pure 
transmission measurements are not satisfactory. To achieve short 
measurement times requires automation, and because the amount and 
complexity of bridge computations are large, the automation has to be 
done with a computer. 

The development of cozy required new design features and mea- 
surement procedures. Ccozy’s bridge differs markedly from manual 
bridges in two ways. First, small impedances are measured with novel 
bridge configurations based on techniques previously used to calibrate 
inductance standards.’ Second, all switching of the bridge configura- 
tions and setting of the standards is done by a new design of mercury- 
wetted contact relay that requires very different design considerations 
than the wafer switches used in manual bridges. 

COZY’s measurement process differs significantly from measurement 
processes in manual bridges in three basic areas: selecting the bridge 
configuration, balancing the bridge, and obtaining the last 1% decades 
of the balance. Tio determine the bridge configuration for a measure- 
ment, cozy calculates an approximate value for the unknown from 
four transmission-type measurements, three of which use predeter- 
mined settings of the bridge to provide a calibration of the system. 
Balancing is done with an iterative process in which capacitance and 
conductance standards are changed, the ratio of the change in the 
admittance of the standards to the change in the bridge output is 
calculated, and the next change to be made in the standards is 
calculated by multiplying this ratio by the last bridge output. The final 
1% decades of the balance are determined by measuring the bridge 
output over a one-second period. If the final degree of balance had 
been limited by noise, this measurement provides increased resolution 
by noise averaging. On the other hand, if the degree of balance had 
been limited by the finite size of the smallest steps of the standards, 
the measurement provides interpolation between these steps. 

To provide information suitable for use in the measurement process, 
the receiver must be phase sensitive and linear right down to zero 
signal. This is accomplished by using heterodyne techniques to produce 
two dc signals whose amplitudes, including signs, represent orthogonal 
components of the bridge’s output signal.* The accuracy of the repre- 
sentation is one percent. 
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To achieve the required control of temperature and humidity, the 
environmental chamber’s heaters, compressor, and humidifer water 
were put under the control of a microcomputer using specially devel- 
oped firmware. A microcomputer rather than cozy’s computer was 
used to enable cozy’s computer to be free for general purpose mea- 
surements while the specified environmental conditions and soak times 
are being achieved. 

This paper describes the hardware, software, and performance of 
the computer-operated bridge and of the facilities added to the bridge 
to provide temperature coefficient and disaccommodation factor mea- 
surements. Section II describes the bridge unit and other basic hard- 
ware. The calibration of the bridge is covered in Section III. Section 
IV describes the software, including the measurement process, inter- 
action with the user, and postprocessing of the measurement results. 
Section V gives the measurement accuracy and discusses the sources 
of measurement uncertainty. Section VI concerns the automatic aids 
for maintaining the accuracy and hardware. The main features of the 
hardware and software for automatically measuring the disaccommo- 
dation factors of ferromagnetic materials and the effects of tempera- 
ture and humidity on impedance are described in Section VII. Section 
VIII is a summary. | 


ll. BRIDGE UNIT AND OTHER BASIC HARDWARE 
2.1. General 


Figure 1 shows the basic hardware blocks for making impedance 
measurements: a signal generator, bridge unit, voltmeter connected to 
the bridge unit, receiver, and analog-to-digital converter, all controlled 
by a computer through an interface and test panel. Figure 2 is a 
photograph of cozy when put into service. The two and one-half bay 
cabinet at the left contains from left to right the signal generator, 
receiver, and bridge unit. Mounted on the horizontal top surface of the 
half-bay are the bridge’s two binding posts to which the unknown is 
connected. The six-bay cabinet on the right contains the computer 
and, at the far end, the interface and test panel. In the middle is a 
teletypewriter. In the background at the end of the six-bay cabinet is 
a “step-up” unit that provides computer controlled admittance ballast 
for automatic calibrations of the bridge’s admittance standards. 


2.2 Bridge unit 
2.2.1 Overall 


The bridge is a unity ratio type with 100-ohm resistors forming the 
ratio arms. Four configurations of the other two arms, accomplished 
by automatic switching, are used to measure the full-admittance range. 
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Fig. 1—Block diagram of computer-operated impedance/admittance bridge (cozy). 


Figure 3 shows simplified schematics of these configurations. The 
capacitance standard, C,, is in the A-D bridge arm for all configura- 
tions; the unknown, UNK, may be in either the A-D or the C-D arm; 
and the conductance standard, G;, is always in the arm adjacent to the 





Fig. 2—Computer-operated impedance/admittance bridge (cozy). 
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SMALL — IMPEDANCE CONFIGURATIONS 
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Fig. 3—Bridge configurations. 


unknown. The “small-admittance” configuration shown in Fig. 8a is 
used for capacitive unknowns having susceptances typically smaller 
than 0.02 siemens and conductances smaller than 0.009 siemens. The 
Fig. 3b configuration is used for similar-sized inductive unknowns. The 
“small-impedance” configurations shown in Figs. 3c and 3d are used 
for capacitive and inductive unknowns, respectively, having suscept- 
ances typically larger than 0.02 siemens and/or conductances larger 
than 0.009 siemens. In these small-impedance configurations measure- 
ments are made with one of eleven calibrated capacitors, Cx, in series 
with the unknown and a similar-sized ballast capacitor, Cxz, in the 
adjacent arm. 

As shown in Fig. 3a, signal is applied to the bridge by a transformer 
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connected between the A and C corners; the receiver is connected 
between the B and D corners; the voltmeter is connected across the C- 
D arm; and the D corner is grounded. 

Kach measurement requires manipulating the capacitance and 
conqductance standards to balance the bridge twice—an unknown 
balance with the unknown connected into a bridge arm and a reference 
balance with the unknown effectively out of the arm. The unknown’s 
admittance is computed from the admittance difference between the 
two balances. For the small-admittance configurations, shown in Figs. 
3a and 3b, the unknown balance is made with the switch in series with 
the unknown closed and the switch shunting the unknown open (as 
shown). The reference balance is made with the series switch open 
and the shuntng switch closed. For the small-impedance configura- 
tions, shown in Figs. 3c and 3d, the shunting switch is open for the 
unknown balance and closed for the reference balance. 

The bridge’s basic blocks and switches are shown in Fig. 4. Two 
transformers are required to cover the frequency range. One is used 
from 200 Hz to 101 kHz and the other, from 101 kHz to 30 MHz. Both 
are double-shielded and specially developed for bridge use. The trans- 
formers’ intershield capacitances, 90 pF and 15 pF, are large enough to 
require the complete disconnection of the unused transformer. 

The ratio arm resistors, R, and Re, are 0.01-percent metal film 
resistors having very small parasitic impedances so that their resist- 
ances are frequency independent well beyond the requirements of this 
bridge. The time constant of the ratio was adjusted to be within 10 ps 
of zero. 

The capacitance standard consists of eight decades covering 1.1 uF 
in 0.01-pF steps. The 1-, 0.1-, and 0.01-pF per step decades are wired 
into the C-D bridge arm. Since the capacitance standard is treated as 
being in the A-D arm, these decades are operated in reverse; that is, 
their “zero” settings are their maximum capacitance settings. 

The conductance standard covers 11,000 uS in 0.001-uS steps and 
consists of nine decades; low frequency and high frequency versions of 
the 1- and 10-uS per step decades are necessary to cover the frequency 
range. The 1000-, and 100-uS and the low frequency 10- and 1-uS per 
step decades can be switched into either the A-D or C-D arm. The 
remaining conductance decades are wired into either the A-D or C-D 
arm. 

The distribution of decades among the A-D arm, the C-D arm, and 
being switched was based on considerations of simultaneously mini- 
mizing the total admittance in the arms, the number of leads at a 
bridge corner, and the frequency dependencies of the decades. 

The switched admittances Y, through Ys; compensate for the 
changes in admittances in the A-D and C-D arms that accompany 
switching the conductance decades from one arm to the other. For 
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Fig. 4—Schematic diagram of bridge unit. 
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example, Y; compensates for the capacitance changes associated with 
switching the 1000- and 100-uS decades. When the decades are in the 
A-D arm, Y; is in the C-D arm, and vice versa. 

The calibrated series capacitor, Cx, and the ballast capacitor, Cxz, 
used in the small-impedance configurations, may be set to: 10, 30, 100, 
300, 1000, 3000 pF and 0.01, 0.03, 0.1, 0.3, and 1 uF. The series capacitor, 
Cx, is contained within a shield and the shield and capacitor can be 
connected to the A, C, or D corner. The ballast capacitors were 
adjusted during prove-in so that the bridge balances with low settings 
on the capacitance and conductance standards when the switch, S1P, 
across the binding posts is closed. 

The bridge was mechanically designed with the following objectives 
in mind: as low as possible impedances in series with the various 
components; the components in one arm shielded from the components 
in the other arms; and the basic blocks in each arm to have independent 
leads to the junction points with the adjacent arms and the associated 
generator or receiver connection, thus well defining each bridge corner. 
The ratio resistors and the A, B, C, and D corner blocks are contained 
in a central rigid structure. The capacitance and conductance decades 
are connected to the A or C block and the D block with coaxial cables. 

All the circuit components were selected for stability with time, 
temperature, humidity, and vibration. In addition, the bridge temper- 
ature is held constant to better than +0.05° Celsius. 

A critical circuit component, developed specially for this bridge, is 
the relay used for all switching. The relay contains a mercury-wetted 
contact switch, shown in Fig. 5, with the leads to the normally open 
contacts made of magnetic alloy and the leads to the normally closed 
contacts made of platinum. With no power applied to the relay the 
two platinum leads are shorted by a bar carried by the armature. 
When power is applied, this bar moves to short the two magnetic alloy 
leads. The magnetic alloy leads provide part of the magnetic circuit 
that permits the relay to be operated without external magnets, which 
would be too bulky. The platinum leads give a stable low-impedance 
path for the critical circuits; the alloy leads’ resistance changes so 
much with time after a relay is switched, due to temperature changes 
caused by local heating, that these leads can be used only in very high 
impedance or noncritical circuits. A specially designed shield and coil 
assembly electrostatically shields the capsule from its driving coil and 
surrounds the whole assembly with an electromagnetic shield that 
completes the magnetic circuit. 

The switch 1s very stable and reproducible. Measurements with 
manual bridges showed that the switches reset with variations of less 
than ten micro-ohms in series resistance and one thousandth picofarad 
in shunt capacitance. 
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Fig. 5—Special mercury-wetted contact switch used in the bridge unit’s relays. 


However, the switch has some disadvantages. It must be used 

upright; it is relatively large with a horizontal center-to-center spacing 

of three-quarters of an inch; and it has 5- pF capacitance from the 
armature to the grounded shield. 

Figure 6 is a photograph looking down on the bridge. So that the 
details show, the top cover and the cover of the shield surrounding the 
ratio resistors have been removed. The junction of the detector lead 
with this shield is the B corner. The A and C corners are below the 
high binding post, H, which is shown. The D corner is below the A and 
C corners. The low binding post, L, is mounted on the top cover and 
in use is located above the shield around the ratio resistors. 
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2.2.2 Design of admittance standards and series capacitor 


A fundamental objective in designing the capacitance and conduct- 
ance standards was to achieve wide frequency performance. The 
problem is that the admittances of the steps of the decades typically 
increase with increasing frequency and the percentage increases are 
larger for the larger steps. As a result, at high frequencies gaps occur 
in the admittances that can be provided by the standards. 

Accommodation for some increases in the admittances of the dec- 
ades’ steps is achieved by using decades.that employ 11 settings: 0, 1, 
2, 3, 4, 5, 6, 7, 8, 9, and 10. The ten-settings provide overlaps at low 
frequencies in the admittance ranges obtained with successive settings 
of any decade except the least significant one. For example, at low 
frequencies the maximum capacitance that can be obtained with the 
one-setting of the 100-pF per step decade being the most significant 
setting is 211.10 pF. This capacitance is obtained by setting the 100-pF 
through 0.01-pF per step decades to 1-10-10.-10-10, respectively. On 
the other hand, the minimum capacitance obtainable with the two- 
setting of the 100-pF decade is 200 pF, achieved with decade settings 
of 2-0-0.-0-0. Thus, there is an 11.1-pF overlap in the capacitance 





Fig. 6—Top view of bridge unit. 
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Fig. 7—Schematic diagram of low-frequency 10-microsiemens per step decade. 


ranges provided by the one- and two-settings of the 100-pF decade. 
Consequently, the capacitance of the two-setting can increase to being 
111.1 pF larger than that of the one-setting before a gap develops 
between their associated capacitance ranges. 

Two general types of decades are used in the cozy bridge: an 
“adding” type consisting of four units that singly and in combination 
form a complete decade, and a “residual” type consisting of 10 or 11 
capacitors (or resistors) switched singly into series with a common 
much smaller capacitor (or larger resistor). The 100,000-pF, 10,000-pF, 
1000-pF, 100-pF, 10-pF, 1000-uS, 100-uS, and the low frequency 10- 
and 1-uS decades are adding type. The rest of the decades are residual 
type. 

The most complex of the adding type decades is the low-frequency 
10-S per step decade, which is used up to 15 MHz. Figure 7 shows the 
decade. The individual units are 10, 20, 30, and 40 uwS. Capacitors are 
used to compensate for the 5-pF capacitances between the switch 
armatures and ground, thereby making the decade’s capacitance in- 
dependent of switch settings. The resistors in series with the capacitors 
are relatively small and were selected to yield conductances that 
partially compensate for the increases with frequency of the conduct- 
ances of the unit resistors as a result of their distributed capacitances. 
Thus, the admittance differences between the decade’s settings are 
almost pure conductance and as independent of frequency as 1s prac- 
tical. 

However, at frequencies above 15 MHz a residual type decade is 
needed to achieve 10-uS steps. The decade contains 11 resistors, 
ranging from 100 to 404.76 ohms, that can be switched singly into 
series with a 1500-ohm resistor. Across each resistor is a capacitor 
adjusted to achieve the same time constant for each resistor and 
thereby to make the conductance differences between the settings 
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independent of frequency. This decade is not used at low frequencies 
because it adds 525 pS to the bridge’s residual admittance. 

As with the bridge standards, a design objective for the series 
capacitors, Cx, was to minimize the increases in their effective capac- 
itances with increasing frequency. However, because the switches have 
5-pF capacitance from armature to ground and 0.7 pF across an open 
switch, a more difficult objective was to achieve at the smaller settings 
satisfactorily small values for the capacitances in series with and 
shunting the binding posts. If these capacitances are too large, gaps 
will exist in the bridge’s impedance coverage at the top frequencies. 
For example, if the minimum values for these capacitances were 12 
pF, then a 1-uH unknown could not be measured at 30 MHz—it could 
not be measured with a small admittance configuration because its 36- 
pF resonating capacitance exceeds the capacitance standard’s 30-pF 
range and it could not be measured with a small-impedance configu- 
ration because the difference between the unknown and reference 
balances would be 37 pF. 

Figure 8 shows the design of the series capacitor, Cx. The minimum 
setting is with switch S1 open and switch S2 closed. Switch S2 is closed 
to put slightly more of the parasitic capacitance across the bridge arm 
where it is harmless. For this setting, the series capacitance is com- 
posed entirely of the capacitance of the wiring to the shield and 
amounts to 10 pF. The capacitance shunting the unknown Is also about 
10 pF. These capacitances are small enough to provide continuous 
impedance coverage between the small-admittance and small-imped- 
ance configurations. The 30-pF setting of the series capacitor is ob- 
tained by closing switch S1 only, which adds a 20-pF capacitor. The 5- 
pF armature-to-ground capacitance of switch $1 increases the total 
capacitance shunting the unknown to 15 pF. The 100-pF setting is 
obtained by closing switches S1, S2, and S4. However, no additonal 
capacitance is thrown across the unknown since the armature-to- 
ground capacitance of switch $2 is across the bridge arm. Switch S4 is 
closed only to concentrate the remaining parasitic capacitances across 
the whole bridge arm. This type of switching is continued through the 
higher values of Cx. 


2.3 Other basic hardware 


The generator contains a frequency synthesizer as the signal source 
and can supply between 1 millivolt and 10 volts in 0.05-dB steps to a 
75-ohm load. The criteria used in selecting the synthesizer included 
low phase noise and low harmonics, and in selecting amplifiers, high 
linearity. These criteria are important because many bridge balances 
are narrow band. In these balances phase noise contributes to the 
noise at balance and so must be kept low. Also, harmonics are not 
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Fig. 8—Schematic diagram of the series capacitor, Cr. 
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strongly attenuated and thus care must be taken to prevent them from 
intermodulating in the receiver to produce a false fundamental. Har- 
monics in the generator’s output are more than 30 dB down from the 
fundamental. 

The receiver’s minimum detectable signal is no larger than 0.05 
microvolts (with a one-second measurement time) and the receiver is 
linear for input signals up to at least 5 volts. Maximum voltage gain is 
one million and at full gain intermodulation of 0.16-volt second and 
third harmonics (corresponding to 30 dB down from a 5-volt funda- 
mental) produces less than two microvolts of fundamental referred to 
the input. Crosstalk corresponds to less than 0.05 microvolts of fun- 
damental at the input. Settling times and overload recovery times are 
no more than 1 ms except for input frequencies below 4.9 kHz, where 
the settling time is 17 ms. 

Between 120 kHz and 30 MHz, two stages of frequency translation 
are used. The first stage uses a synthesizer as a local oscillator and 
produces an IF signal at 28 kHz. The second stage uses two ring 
modulators in parallel to translate the IF signal to two dc voltages. 
The local oscillator sources for the modulators are two 28-kHz sine 
waves, with a 90 degree phase difference between them, generated 
from the same signal sources used to generate the IF signal. 

Between 200 Hz and 120 kHz, three stages of frequency translation 
are used: from 200 Hz to 4.9 kHz, the 28-kHz IF is preceded by a 97- 
kHz IF; and from 4.9 kHz to 120 kHz, by a 528-kHz IF. 

A low-pass filter preceding the first mixer attenuates harmonics of 
the test frequency so that they do not generate significant IF signals 
by intermodulation in the mixer. The receiver contains 29 of these low- 
pass filters and each filter is used over a frequency range of approxi- 
mately two-thirds of an octave, 1.e., the filters cover 200 to 315 Hz, 315 
to 480 Hz, etc. The filters below 45 kHz are active and have input 
impedances of 600 + 2 ohms. The filters above 45 kHz are passive and 
have 75-ohm input impedances and return losses larger than 15 dB. 

The receiver’s two dc outputs are read one after the other into the 
computer by the A/D converter. Noise averaging is done by taking 
successive pairs of readings with 1.4 ms between each pair. The A/D 
converter covers the range from —10 Vdc to +10 Vde with a 15-bit 
(including sign) output. At the converter’s input is a 48-channel mul- 
tiplexer. 

A specially developed voltmeter is connected across one arm of the 
bridge and puts out a dc voltage proportional to the rf voltage at its 
input. Four gain settings are used to cover the 0.05- to 5-volt range 
and the ratios of the dc to rf voltages are within two percent of the 
nominal values. One percent accuracy is achieved with simple com- 
puter corrections. Voltages below 0.05 volts can be measured with 
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some degradation in accuracy. The response time for one percent 
accuracy is 24 ms. An important feature of the voltmeter is that its 
input admittance is very stable and small; it is 7.2 pF and 0.4 uS at 100 
kHz. 

The test panel contains a light for observing and a switch for 
manually controlling the state of each computer-controlled relay. 
Manual control of the relays is valuable for prove-in and trouble 
diagnosis of the system. Observation of the lights during the measure- 
ment process yields information on whether the process is going well, 
and if not, where the source of trouble is. 

The computer originally included automatic priority interrupt, 24K 
words of core memory, a 256K-word fixed head disk and three magnetic 
tape drives. Since then the computer has been upgraded to include 
32K words of core memory, two 1% M-word cartridge disks, and 
floating-point hardware. 


lil. CALIBRATION OF THE BRIDGE UNIT 
3.1 General 


Calibration values are determined for the capacitance and conduct- 
ance of each step of the capacitance standard, the conductance stan- 
dard, and the series capacitors and of the various admittances that 
appear in the equivalent circuits used in data reduction. These deter- 
minations are made at the 18 frequencies of 0.2, 0.5, 1, 2 kHz, ---, 2, 5, 
10, 15, 20, and 30 mHz. 

The equivalent circuits for data reduction are given in Section 3.2 
and the basic calibration procedures are given in Sections 3.3 and 3.4. 
However, note that in some cases the actual calibration values assigned 
are based on more than a single method of determination. 


3.2 Equivalent circuits for data reduction 


Data reduction is based on simplified equivalent circuits relating an 
unknown’s admittance to the admittance difference between the un- 
known and reference balances. An objective in choosing the equivalent 
circuits was that their various admittances could be evaluated with 
relatively simple procedures and available external admittance stan- 
dards. 

Figure 9 shows the equivalent circuits used for the unknown and 
reference balances for the small-admittance configuration when mea- 
suring capacitive unknowns. Only three admittances per balance are 
necessary to characterize the circuitry connecting the unknown to the 
bridge corners. These equivalent admittances may, of course, be 
strongly frequency dependent. In the reference balance, the impedance 
of the short-circuiting switch across the binding posts is so small 
compared to the impedance in series with it and the impedance of the 
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unknown that the unknown’s impedance does not significantly affect 
the reference balance. Thus, the three equivalent admittances for the 
reference balance reduce to one. 

From Fig. 9, the admittance difference, Yu — Yr, between the 
unknown and reference balances 1s given by 


Yx + Yup 
a = + Von — Yer, 1] 
Yu Yr 1+ Zcu ( Yx t+ Yup) i oo 


where Yx is the admittance of the unknown. As part of the calibration 


procedure, the admittance difference, Yur, between unknown and 
reference balances with nothing across the binding posts, is measured 


UNKNOWN 





UNKNOWN BALANCE 


(a) 


(Zuor Your)? < 107° 





REFERENCE BALANCE 


(b) 


Fig. 9—Equivalent circuits used for reducing measurement data taken with the small- 
admittance confipuration of Fig. 3a. 
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Fig. 10—Equivalent circuit used for reducing measurement data taken with the small- 
impedance configuration of Fig. 3d. 


and stored. This difference is given by 
Yup 


VS 
oer 1+ Zcn Yup 


+ Yop — Yer. (2) 
Substituting the expression for Ycp — Ycr, obtained from eq. (2) into 
eq. (1), yields 


Yx 
ee eS a ee + 
[1 + Zcu(Yx + Yup)]-(1 + Zcx Yun] 


This equation is the basis for data reduction. Values for Yyp and Zcy 
are determined and stored in files as part of the calibration process. 

For unknowns having small inductive admittances the values for the 
equivalent circuit admittances are for the A-D arm. 

Figure 10 shows the equivalent circuit for data reduction of inductive 
unknowns measured with the small-impedance configuration. The 
unknown balance is made with the short-circuiting switch, S1P, open; 
and the reference balance, with the switch closed. Yx is the effective 
admittance between the C corner and the F junction. Yar is the 
admittance across the open switch, S2P (see Fig. 4), to the A corner 
and is about 0.7 pF. The admittance difference, Yu — Yr, between the 
settings of the standards for the unknown and reference balances is 


—(Yx — Y’z) 
[1+ Zsw(Yx+ Yae + Yxre))(Yx+ Yar + Yxe)’ 


Yu — Yr= Yur. = (3) 


Yu- Yr= (4) 


where 
Yx t+ Yur 


Yxe = Yep + ———_———_-.. 
= me ee Zen(Yx + Yur) 


(5) 
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The equivalent circuit for measuring small impedance capacitive 
unknowns differs from Fig. 10 only in that the unknown is connected 
into the A-D arm. 


3.3 Calibration of the bridge standards and of the parasitic admittances 
for the small-admittance configurations 


Determining the calibrations for the capacitance and conductance 
standards involved four very different phases: (1) the individual steps 
of the standards were intercompared and consistent calibrations for 
the standards were obtained by a step-up calibration”®; (2) the mag- 
nitude and phase of the bridge ratio were determined; (3) the admit- 
tance, Yyp, shunting the binding posts in the small-admittance config- 
uration for measuring capacitors was evaluated; and (4) the impedance, 
Zcu, M series with the binding posts and absolute values for the 
capacitance and conductance standards were determined from mea- 
surements of external standards. 

In a step-up calibration, each incremental step of each decade is 
compared with the full range of the next smaller decade. For example, 
consider the calibration of the steps of the 10-pF per step decade. The 
bridge is set in the configuration for measuring small capacitance, Fig. 
4a, and external adjustable capacitance and conductance ballast is 
connected to the bridge’s binding posts. The capacitance standard’s 
10-pF through 0.01-pF decades are set to 0-10.-5-5; the conductance 
standard is set to 0-0-5-5.-5-5-5; and the bridge is balanced by adjusting 
the external ballast. Then the 10-pF decade is set to one, the 1-pF 
decade is set to zero, and the bridge is rebalanced using the 0.1-pF 
decade, the 0.01-pF decade, and the conductance standard. Assume 
the standards’ settings at this second balance are 1-0.-5-2 pF and 
0-0-5-5.-5-1-2 wS. From the bridge settings we would calculate that the 
first step of the 10-pF decade is 0.03 pF and 0.043 uS larger than the 
full 10-pF range of the 1-pF decade. Similarly, the size of the step 
between the one- and two-settings of the 10-pF decade is then com- 
pared with the full range of the 1-pF decade by making balances with 
these decades set first at 1-10 and then at 2-0. The steps of the 1-pF 
decade are in turn compared with the full range of the 0.1-pF decade 
and the steps of that decade, with the 0.01-pF decade. 

The 0.01-pF decade cannot be compared with a smaller decade— 
there is none. Instead, the individual incremental steps of the decade 
are intercompared via transmission-type measurements made with the 
receiver, and the results are processed to yield a calibration for the 
one- through ten-settings relative to the full range of the decade. 

After all the steps of all the decades have been intercompared, the 
data is processed to yield a consistent calibration. The processing 
starts at the smallest decades and proceeds up both the capacitance 
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and conductance decades. The calibration calculated for the conduct- 
ance decades is applied in reducing the data for the capacitance 
decades and vice versa. 

The step-up calibration for the capacitance standard differs from 
the absolute calibration in magnitude and phase angle, the same 
percentage difference and the same phase angle difference for each 
setting of each decade. Thus, the capacitance and conductance correc- 
tions to be added to the step-up calibrations for the individual steps 
are proportional to the capacitances of the steps. Similarly for the 
conductance standard, the differences between its step-up calibration 
and its absolute calibration are a percentage for the conductance of 
each step and a capacitance proportional to the step’s conductance for 
the capacitance of each step. 

The magnitude and phase angle of the bridge ratio were determined 
using the auxiliary binding posts symmetrically connected across the 
A-D and C-D bridge arms via mode switches S16P and S17P (see Fig. 
4). Bridge balances were made with: (1) nothing across the auxiliary 
binding posts; (2) a capacitance C4 connected to the A-D binding 
posts, and a similar sized capacitance Cc connected to the C-D binding 
posts; and (3) Cc connected to the A-D binding posts and C', to the C- 
D binding posts. The effects of any differences between the series 
impedances of the internal coaxial leads between the bridge corners 
and the auxiliary binding posts were calculated and taken into account 
by making similar balances using a different capacitance value for C, 
and Cc. 

The admittance, Yxp, shunting the binding posts in the small- 
admittance configuration was measured by using switches S3P and 
S2P to connect Yup first into the C-D arm and then into the A-D arm. 
Yup 1s one-half the difference between the two resulting balances, 
corrected for the effects of the impedances in series with Yyp and of 
the armature-to-ground capacitances in switches S2P and S3P. 

The impedance, Zcu, in series with the binding posts was determined 
and absolute calibrations for the bridge standards were obtained by 
measuring external standards. Two external capacitance standards 
were measured in terms of the step-up calibration. The data-reduction 
equations with known values for the unknowns and the admittance 
Yyp were then solved for the value for the series impedance Zcy and 
the percentage and phase-angle corrections to be made in the step-up 
calibration for the bridge’s capacitance standard. Then, an external 
conductance standard was measured to determine the percentage and 
phase angle corrections to be made in the step-up calibration of the 
bridge’s conductance standard. 

The external capacitance standards used at frequencies up to 100 
kHz were mica capacitors calibrated on a Type 12 capacitance bridge’ 
that has a basic accuracy of +50 ppm for capacitance and +25 micro- 
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radians for loss angle. At frequencies above 100 kHz, the external 
capacitance standards were parallel plate capacitors with air dielectric 
and geometries permitting their conductances and the frequency de- 
pendencies of their capacitances to be calculated. The low-frequency 
capacitances of these standards were obtained by measuring them on 
the bridge at 100 kHz using the calibration obtained with the external 
mica capacitance standards. The parallel plate capacitors were not 
disturbed between the 100-kHz measurements and the higher-fre- 
quency measurements. 

The external conductance standards were 100- and 1000-ohm metal 
film resistors whose conductances and shunt capacitances were deter- 
mined by various procedures using the Type 12 capacitance bridge and 
a Wheatstone bridge having a basic accuracy of +10 parts per million. 
The conductances and capacitances of these resistors are not signifi- 
cant functions of frequency. 

The impedance in series with the binding posts when they are 
connected into the A-D arm was evaluated by comparing measure- 
ments made in the A-D arm with measurements made in the C-D arm. 
Also, the admittance difference between the unknown and reference 
balances with nothing connected to the binding posts was measured 
with the binding posts in the A-D arm. 

Step-up calibrations of the standards, including the intercompari- 
sons of the steps and the data reduction, are done automatically at 
frequencies up to 5 MHz. For these calibrations the adjustable capac- 
itance and conductance ballast connected to the bridge’s binding posts 
is a specially developed “step-up unit’? under computer control. This 
unit contains capacitance and conductance decades similar to those in 
the bridge. The steps of the step-up unit’s decades are calibrated by 
automatically measuring them with the bridge. 

Above 5 MHz the performance of the step-up unit is unsatisfactory 
and special manually adjustable capacitance and conductance ballasts 
are used. The intercomparison process is semiautomatic, with the 
operator adjusting the ballast and the computer balancing the bridge, 
recording the results, and reducing the data. 

Measurements of the admittance differences between the unknown 
and reference balances with nothing connected to the bridge’s binding 
posts are also done automatically. 


3.4 Calibration of the series capacitors and parasitic admittances for the 
small-impedance configurations : 
The admittances/impedances in the equivalent circuit of Fig. 10 for 
the small-impedance configurations were evaluated in four phases. 
The direct capacitance, Cyr, from the £ junction to the A corner via 
the open switch S2P was measured at 10 kHz with the Type 12 
capacitance bridge. 
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The admittances, Yx and Yez, in series with and shunting the 
unknown were determined from various bridge balances. For example, 
at the setting for minimum series capacitance three balances were 
made: one with the two admittances in series and one each with the 
individual admittances connected across the C-D arm. Corrections for 
the shunt admittances and series impedances of the switches used to 
connect in the individual admittances enter into the determinations. 

The admittance, Yx,, across the binding posts was estimated to be 
2 pF. 

The switch impedance, Zsw, and the binding post impedance, Zen, 
were determined from measurements of the impedance differences 
between the individual impedances and the impedances in parallel. 
Three bridge balances were made using the small-impedance configu- 
ration and the largest series capacitor calibrated at the frequency of 
the determination. One balance was with the switch, S1P, closed and 
nothing across the binding posts. Another balance was with the switch 
open and the binding posts short-circuited by a metallic plate having 
effectively zero impedance. The third balance was with the switch 
closed and the binding posts short-circuited. 

The bridge construction is so symmetrical that the values for the 
circuit elements for the unknown in the A-D arm are the same as for 
the unknown in the C-D arm. 

The measurements and data reduction are done automatically for 
the series and shunt admittances, Yx and Ye7, and for the switch and 
binding post impedances, Zsw and Zen. 


IV. SOFTWARE AND MEASUREMENT PROCESS 
4.1 Software structure 


The structure of the software for making measurements embodies a 
clearly defined division of responsibility between the measurement 
function and the user interaction and post processing functions. This 
enables the measurement center personnel responsible for the day-to- 
day operation to respond to changing customer needs without getting 
involved in the measuring process itself. 

Three modules, labeled INPUT, OUTPUT, and CNTROL, provide for the 
user interaction and post processing. They were written by the mea- 
surement center. One module, MEASUR, provides the measurement 
process and was written by the developers of cozy. The INPUT module 
conducts the dialogue with the user. When the dialogue is completed, 
INPUT passes the collected data and program control to the CNTROL 
module. CNTROL oversees the making of measurements according to 
the user-supplied data. For each individual measurement, 1.e., a single 
unknown at a single frequency and a single signal level, CNTROL passes 
the measurement frequency and signal level to MEASUR. MEASUR then 
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makes the measurement. When all the specified measurements have 
been made, CNTROL passes the data and program control to the OUTPUT 
module. OUTPUT processes the measurement results according to the 
user requests specified to INPUT and presents the processed data in 
the desired formats. 

The INPUT, CNTROL, and OUTPUT modules are each separate over- 
lays. Part of MEASUR Is in the CNTROL overlay but the bulk of MEASUR 
is in two additional overlays. 


4.2 MEASUR— The measurement module 


The module MEASUR contains all the routines required to measure 
one unknown at one frequency and one signal level. The test frequency 
and the desired signal level are passed to MEASUR via dedicated 
locations in the COMMON storage area. Similarly, MEASUR fills other 
dedicated locations in COMMON with the measurement results. Included 
in these results are: the unknown’s admittance as seen at the bridge 
binding posts, the measured signal level applied to the unknown, and 
information specifying the bridge configuration used. 

Figure 11 shows a flowchart for MEASUR. MEASUR starts by tuning 
the generator and receiver for the specified test frequency and setting 
the signal level and the receiver gain to their minimum values. Then 
the calibration data for the capacitance standard, the conductance 
standard, the series capacitors, and the bridge’s parasitic admittances 
are obtained. For measurement frequencies that correspond to calibra- 
tion frequencies, the data is read from disk files. For measurement 
frequencies between the calibration frequencies, admittance values are 
obtained by interpolation between the values at the calibration fre- 
quencies next above and below the measurement frequency. The 
interpolation formulas are based on the physical causes for the fre- 
quency dependencies. The formulas were checked by comparing cali- 
bration values at a calibration frequency with values obtained by 
interpolation using the next lower and higher calibration frequencies. 

To efficiently select a bridge configuration and start the balancing 
process, an approximate value for the unknown’s admittance is deter- 
mined. Preparatory to determining the approximate value, the signal 
level is set so that during the determination the level applied to the 
unknown will not exceed that specified for the actual measurement. 
This level is set by effectively placing the voltmeter across the A-C 
bridge diagonal, i.e., across the secondary of the transformer, and 
adjusting the generator to the voltage requested by the user. For this 
preliminary level setting, a specified current is expressed as the cor- 
responding voltage across a 100-ohm resistance, which is approxi- 
mately the impedance in series with the unknown during the deter- 
mination. 
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Fig. 11—Flowchart for the software module, MEASUR, that makes a measurement at 
a single frequency and signal level. 
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The approximate admittance of the unknown is calculated from 
measurements of the bridge output for four separate conditions, three 
of which serve to calibrate the system. All four conditions use the 
conductance standard in the C-D arm set to 9900 uS, which is approx- 
imately the admittance level between the small-impedance and small- 
admittance configurations. The three conditions used to calibrate the 
system are: a “short,” an “open,” and 6280 uS of susceptance in the A- 
D arm. The admittance values used for the “short” and “open” are 
based on the parasitic admittances of the switches, S1P and S2P, used 
to achieve the “short” and “open.” The 6280-uS susceptance is ob- 
tained by an appropriate setting of the capacitance standard, and the 
actual susceptance and conductance of the setting are obtained from 
the standard’s calibration data. The fourth condition uses the unknown 
in the A-D arm. 

A small-admittance configuration is used if the conductance of the 
unknown is less than 90 percent of the 10,000-uS range of the con- 
ductance standard and if the magnitude of the unknown’s susceptance 
is within 0.02 siemens and 75 percent of the range of the capacitance 
standard. Otherwise a small-impedance configuration is used. 

When a small-impedance configuration is selected, the program 
chooses the largest series capacitor that is calibrated at the test 
frequency and that meets both of the following requirements: (1) the 
impedance of the capacitor and unknown in series must be greater 
than 20 ohms, and (2) the capacitance and conductance changes 
between the series capacitor alone and the series combination of the 
capacitor and the unknown must be within 75 percent of the range of 
the capacitance standard and 90 percent of the range of the conduct- 
ance standard. 

The arm into which the unknown is connected is determined by the 
sign of the unknown’s susceptance. 

After the bridge is set to the selected configuration and the bridge 
standards are set at values calculated to yield a bridge balance, the 
signal generator is adjusted to apply the requested level to the un- 
known. The level computations are based on the approximate value 
for the unknown and on the bridge configuration. 

The bridge is then balanced at the reference condition. In the 
balancing process, the bridge’s capacitance and conductance standards 
are iteratively adjusted toward balance, and the receiver gain is itera- 
tively increased to maintain the receiver’s output within the working 
range of the A/D converter. The changes to be made in the bridge 
standards are computed from the bridge’s output, which is approxi- 
mately proportional to the admittance difference between the A-D 
and C-D bridge arms. The constant of proportionality, in terms of 
siemens-per-volt, is computed after each change in the standards. This 
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computation has to include the relative gains of the receiver before 
and after changing the standards. At frequencies above 45 kHz, the 
impedances of the filters following the rf attenuators are so poorly 
known (15 dB return loss) that the insertion loss of the last rf atten- 
uator must be determined at the time of its being removed. This 
determination is made by adjusting the bridge unbalance to yield a 
suitably sized signal and reading the receiver’s output with the atten- 
uator in and then with it out. 

When the bridge is very nearly balanced, the bridge’s output signal 
is small and the resulting signal-to-noise ratio may be too small for 
accurate determinations of the siemens-per-volt. Accordingly, the sie- 
mens-per-volt is specifically calibrated for a final time when the bridge 
unbalance is within 0.5 percent of the total admittance across the C-D 
bridge arm. Between this degree of unbalance and complete balance, 
the siemens-per-volt will stay constant to within 0.25 percent. 

The iterative process used to converge to balance is terminated 
when one of three conditions has been met: (1) the unbalance is within 
the smallest steps of the standards plus an allowance for noise; (2) the 
unbalance is within the rms deviation due to noise; or (3) there have 
been 50 iterations (a trouble condition). 

The admittance of the remaining unbalance is calculated from a 
one-second reading of the receiver’s outputs and the final siemens-per- 
volt calibration. So receiver’s output signal is large enough to be read 
accurately and/or the output noise is well above the resolution of the 
A/D converter, the mean of the higher-valued receiver output is made 
greater than 5 volts or the standard deviation of the noisier receiver 
output is made greater than 0.2 volts. Then 700 pairs of samples of the 
outputs are taken over a one-second period. 

A check is made of the correctness of the choice of bridge arm if the 
approximate admittance of the unknown differs by a factor of 100 or 
more from the 9900-uS admittance level used in determining the 
approximate value. For such admittances the determination of the 
approximate susceptance is not made with sufficient accuracy to assure 
consistently correct arm selection. Similarly the choice of arm is 
checked if the Q-value is less than 0.1 because the determination of 
the unknown’s susceptance is not made accurately enough in the 
presence of relatively large conductance. 

The check of the arm selection is made by leaving the bridge 
standards set at the reference balance; setting the S1P, S2P, and S3P 
switches for the unknown balance; and reading the receiver’s output. 
If the susceptance change indicated by the output would require 
decreasing the setting of the capacitance standard to achieve balance, 
the unknown is switched into the other bridge arm and the setting of 
the signal level and the making of the reference balance are repeated. 
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When the arm selection is satisfactory after the one-second reading 
is complete, an unknown balance and one-second reading are made. 

Then a second check on arm selection is made. If the capacitance of 
the unknown balance and its one-second reading is less than that for 
the reference balance, the arm selection is changed and the measure- 
ment is repeated starting at setting the signal level. 


When the arm selection is proper, the voltage across the C-D arm of 
the bridge is read while the bridge is still set at the unknown balance. 
The admittance of the unknown and the signal level being applied to 
it are calculated. If the level is within 10 percent of the specified value, 
the measurement is complete and MEASUR returns. Otherwise, the 
signal level is reset and the measurement is repeated. However, this 
time the value of the unknown 1s known more accurately than the first 
time so the achieved signal level will be closer to the specified level. 


4.3 User interaction and postprocessing 


The simplest use of cozy consists of connecting the unknown to the 
bridge and entering a test frequency via teletypewriter dialogue with 
the program. After approximately 20 seconds, the measured value, the 
measurement uncertainty, and the signal level are typed out. In 
measurements where the signal level is not specified, as in this example, 
four volts are applied to the bridge. 

Up to 15 frequencies and a signal level may be specified in a run. 
The signal level may be specified in terms of voltage across the 
unknown or current through it. Signal levels below the nominal minima 
of 50 millivolts or 0.5 milliamperes can be specified, but the measure- 
ment precision may be degraded by receiver noise. Above 15 MHz, the 
maximum signal levels decrease from 5 volts or 50 milliamperes to 0.4 
volts or 4 milliamperes at 30 MHz. 

The postprocessing options include having the results expressed in 
terms of a parallel model (e.g., parallel capacitance and conductance), 
a series model, and magnitude and angle. Also, Q-values may be 
requested. In addition, an inductor can be measured at several fre- 
quencies, and the change in effective inductance with frequency can 
be represented as a capacitance across the inductor. 

When a cable or fixture is used to attach a component to the bridge, 
one or two measurements (at each frequency in the case of a frequency 
run) may be used to characterize the cable. Then, after the component 
has been measured, the effects of the connecting cable or fixture are 
automatically corrected for. The characterization measurements are 
of the open-circuit admittance and/or the short-circuit impedance of 
the cable or fixture. The circuit model for the cable or fixture is that 
of a uniform transmission line. 
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The teletypewriter, a line printer and/or magnetic tape may be 
specified as the output medium. 


V. ACCURACY 
5.1 Accuracy for measuring the impedance/admittance of components 


Figure 12 shows on a reactance chart contours of +0.05-, +0.25- and 
+1-percent uncertainties in the measurements of the inductance or 
capacitance of components having large Q-values. Figure 13 shows 
contours for +50- and +250-microradian uncertainties in measuring 
the loss angles of such components. These loss angle uncertainties 
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Fig. 12—Inductance/capacitance measurement uncertainty for high-Q components. 
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Fig. 13—Loss angle measurement uncertainty for high-Q components. 


correspond to +5 percent uncertainty in measuring Q-values of 1000 
and 200. Figure 14 shows +0.05-, +0.1-, and +1-percent uncertainty 
contours for measuring the resistance or conductance of components 
having small Q-values. Figure 15 shows contours of +100-, +1000-, and 
+10,000-microradian uncertainties in measuring the phase angle of 
these components. For clarity all four figures show smoothed contours 
for the small-impedance configurations. The exact contours would 
have small sawtooth wiggles as successively smaller series capacitors 
were used with increasing frequency. Also small differences between 
the uncertainties for measuring inductors and capacitors have been 
ignored. 
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The confidence factor for the uncertainty contours is 75 percent. 
That is, at a contour there is a 75 percent probability that the error in 
a measurement is less than the uncertainty associated with the con- 
tour. Well within a contour the confidence factor is much higher than 
75 percent. For example, well within the +0.05 percent contour of Fig. 
12 the uncertainty approaches +0.02 percent and the probability that 
the error is within 0.05 percent approaches unity. 

The uncertainties shown in the figures are for voltages across the C- 
D bridge arm between 0.05 V and 5 V. For small-admittance compo- 
nents, these voltages correspond to test voltages across the component 
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Fig. 14—Resistance/conductance measurement uncertainty for low-Q components. 
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Fig. 15—Phase angle measurement uncertainty for low-@ components. 


between 0.05 V and 5 V; for small-impedance components, to test 
currents through the components of approximately 0.5 mA to 50 mA. 

For components measured with the small-admittance configura- 
tions, the basic measured quantity is the admittance difference be- 
tween a measurement of the component plus connecting leads and a 
measurement of the open-circuited leads. This admittance difference 
can be translated from cozy’s binding posts to the other end of the 
connecting leads by correcting for the effect of the leads’ series imped- 
ance. This impedance is obtained by measuring the leads with a short 
circult in place of the component. Figures 12 through 15 apply to the 
admittance difference as seen either at Cozy’s binding posts or at the 
component’s end of the connecting leads. 
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For components measured with the small-impedance configurations, 
the uncertainty contours are for the impedance difference between a 
measurement of the component at the end of connecting leads having 
an inductance of 0.1 wH or larger and a measurement of a short-circuit 
at the end of the leads. This requirement for a minimum 0.1-~H 
inductance in the connecting leads arises because the condition used 
for the reference balance differs from that for the unknown balance by 
the closing of the switch, S1P in Fig. 4, across the binding posts. If the 
impedance connected to the binding posts is small compared to that of 
the switch, then the impedance difference between the unknown 
balance and the reference balance is insensitive to the impedance 
connected to the binding posts. Since the inductance of the switch is 
0.025 wH, using connecting leads having inductances of 0.1 »H or more 
provides good sensitivity for measuring the inductance of the leads 
with the component connected to them and the inductance of the 
leads with the short-circuit connected to them. In addition, with 0.1- 
utH leads, errors in the calibration of the impedance of the switch have 
approximately equal effects on the measurements of the leads with the 
component connected and of the leads with the short-circuit con- 
nected. As a result, errors in the calibration of the switch’s impedance 
have only slight effects on the impedance difference between the 
measurements. 

For components measured with the small-impedance configurations, 
the uncertainty contours also apply to the impedance difference after 
being translated to the component’s end of the connecting leads by 
correcting the above impedance difference for the connecting leads’ 
shunt admittance, which is obtained by measuring the leads open- 
circuited. 

Figures 12 through 15 were obtained from an analysis of the effects 
on measurements of the uncertainties in the individual quantities 
entering into the measurement results. The uncertainties for these 
quantities were arrived at by a combination of: (1) analysis of the 
uncertainties in the determinations of the individual quantities by the 
calibration processes; (2) intercomparisons of the measurements of the 
same unknown made in different ways (e.g., made with both the small- 
admittance and small-impedance bridge configurations); (3) measure- 
ments of some standards whose admittances are known by other 
means (e.g., specially constructed parallel plate capacitors and the 
inductance standards of Ref. 1); and (4) much use of the bridge for the 
past eight years. 


5.2 Sources of uncertainty in small-admittance configuration 
measurements 


The sources of uncertainty for measuring a component’s capacitance 
or inductance using the small-admittance configuration are in the 
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capacitance calibrations of the bridge standards and in the inductance 
calibration for the impedance, Zcy in Fig. 9, in series with the binding 
posts. The uncertainty in the calibration of the capacitance of the 
capacitance standard at frequencies below 100 kHz is +0.02 percent. 
This value represents the uncertainties due to aging and to transferring 
a calibration from the Type 12 capacitance bridge. The basic uncer- 
tainty of the Type 12 bridge is +0.005 percent. Checks of the cozy 
bridge show its variations with time to be less than +0.01 percent. At 
frequencies above 4 kHz the end term uncertainty in the capacitance 
calibration for large capacitances is +0.02 pF and results from: mutual 
capacitances between the standard’s decades; accumulation in the 
calibration values for the larger steps of the small uncertainties in each 
of the many balances that contribute to their calibrations; and there 
being several capacitance and conductance decades involved in any 
one balance setting. At frequencies below 4 kHz the end term uncer- 
tainty corresponds to a susceptance resolution of +0.5 nS. 

An uncertainty of +5 nH is assigned to the calibration of the 
parasitic series impedance, Zcy. However, it is not known how much 
of this comes from the uncertainty in the value for Zcy and how much 
comes from mutual inductances between the decades of the capaci- 
tance standard. 

The 0.02-pF and 5-nH uncertainties produce uncertainties in the 
measurements of the external standards used to determine absolute 
calibrations for the bridge’s capacitance standard. The minimum mea- 
surement uncertainty occurs when the individual contributions of 
these uncertainties are equal. At the optimum admittance level the 
resulting calibration uncertainty is +0.013M percent, where /M is the 
calibration frequency in megahertz. 

For measurements of unknowns having small @-values there is an 
additional capacitance uncertainty of +0.06M/Q-x percent, where Qx 
is the Q-value of the unknown. This uncertainty results from a 100-ps 
time constant uncertainty in the calibration of the phase angle of the 
bridge’s conductance standard. 

The major sources of uncertainty in the measurement of conduct- 
ance using the small-admittance configurations are: 

@ +0.02 percent and +0.002-uS uncertainties in the calibration of 

the conductance standard 

.@ +20 microradians and +5-ps time constant uncertainties in the 

calibration of the loss angle of the capacitance standard. Actually, 
part of the 5-ps uncertainty may be due to uncertainty in the time 
constant of the bridge ratio. These uncertainties are the major 
uncertainties in measuring large Q-values. 

@ +0.006M-uS conductance resolution (corresponding to +0.001 

pF), where M is the frequency in megahertz. 

@ +0.001 VM-ohm uncertainty in the resistance of Zcu. 
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5.3 Sources of uncertainty in small-impedance configuration 
measurements 


For measurements using the small-impedance configurations, the 
uncertainties in the calibrations of the capacitance and conductance 
standards produce corresponding uncertainties in the measured ad- 
mittance difference between the unknown and reference balances and 
in the calibration values for the series and shunt admittances, Yx and 
Yer in Fig. 10. In addition, the measurement of the admittance 
difference between the balances has an end term uncertainty of +2 
parts per million of the series admittance, Yx. 

The end terms for the impedance difference between an unknown 
and a short circuit at the end of 0.1-~H connecting leads are +[(0.2 to 
0.7) + 0.016/M]nH and +[(0.2 to 0.5) + 0.6M]mQ. The minimum 
values pertain to components having impedances small compared to 
the impedance of the connecting leads. The maximum values are for 
component impedances large compared to that of the connecting leads. 


VI. AUTOMATIC MAINTENANCE AIDS 


The overall system includes automatic aids for maintaining the 
calibration of the bridge standards and the operation of the hardware. 
Of particular importance are the aids for calibration maintenance. 
Errors in the calibration typically do not interfere with the measure- 
ment process to cause error messages, as hardware faults do, and so 
the only way to have continuing confidence in COZy’s accuracy is to 
check the calibration on a routine basis. However, calibration checks 
involve so many bridge balances and computations that to make them 
practical on a routine basis requires automation. 

The calibration maintenance aids include a group of programs that 
check for changes in the admittances of the steps of the capacitance 
and conductance standards. One program does a step-up calibration of 
the bridge standards for up to six successive times. The average values 
and the scatters for the capacitance and conductance of each step are 
printed out. Examination of these results shows whether the system is 
performing well and whether the reproducibilities of the steps are 
satisfactory. 

Another program performs an element by element subtraction be- 
tween the values of a new calibration and of the calibration being used. 
for measurements. Examination of the results shows whether the 
calibration being used is satisfactory. 

Still another program automatically measures the series and shunt 
admittances, Yx and Yer, of the small-impedance configurations. 
These results can be compared with the existing calibrations. 

The aids for maintaining the hardware include monitors at the 13 
power supplies and at 29 points within the generator and receiver. The 
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outputs of the monitors can be read by the A/D converter and are 
used by a hardware checking program to measure: the voltages of the 
power supplies; the gains and losses of the amplifiers and attenuators; 
the in-band transmissions and out-of-band rejections of the low pass 
filters; and various signal levels. The program can be run to check that 
the functions and levels are within limits or to print out the results of 
the measurements. A go/no-go check of the functions and levels takes 
five minutes. When the generator or receiver is malfunctioning, a 
diagnosis of the measurements can lead to localizing the fault to within 
four active components. 

A second hardware checking program tests the operation of the 
relays in the small-step decades of the bridge’s and step-up unit’s 
capacitance and conductance standards. The special mercury-wetted 
switches, which use platinum leads, are not as reliable as the standard 
switches, which use only magnetic alloy leads; the platinum-to-glass 
seal is fragile. As a result, some relays have failed. Typically these 
relays were in the bridge’s small-step decades, which contain the most 
used relays. A few of these failed relays gave troubles by becoming 
slower than the settling times used in the measuring program. In these 
cases, static tests of relay operation were not sufficient. 


Vil. MEASUREMENTS OF ENVIRONMENTAL COEFFICIENTS AND 
DISACCOMMODATION FACTORS 


7.1 Environmental coefficient measurements 


To make environmental coefficient measurements the user connects 
the components to sample boards in the environmental chamber, 
connects a coaxial cable from the sample boards to the bridge’s binding 
posts and enters via dialogue the desired measurement conditions, post 
processing of the measurement results and output media. Up to six 
components may be connected to each of three sample boards, with 
all the components on a board having the same nominal impedance. 
One signal level and up to 20 test frequencies may be specified for each 
board. Up to 20 environmental conditions, i.e., combinations of dry 
bulb temperature and relative humidity, may be specified. For each 
environmental condition there may also be specified a soak time, and 
whether the rate of temperature change is to be kept below 1° Celsius 
per minute (to avoid thermal shock). 

For each environmental condition the computer passes to the cham- 
ber’s microprocessor the dry bulb temperature, dew point, soak time, 
and whether the rate of temperature change is to be restricted. Then 
the computer repetitively queries the microcomputer concerning 
whether the specified conditions have been met. When the conditions 
have been met, cozy makes the specified impedance/admittance mea- 
surements and the computer then passes to the microcomputer the 
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next set of conditions. While the environmental conditions are being 
achieved, the computer and bridge unit are free for general purpose 
measurements. 

Each of the three sample boards in the environmental chamber has 
eight pairs of binding posts. Six of the eight pairs are for samples, the 
seventh pair has a short circuit across it and the eighth pair is left 
open. The short-circuited and open-circuited binding posts are used to 
obtain corrections for the effects of the series impedances and shunt 
admittances of the cables, switches and sample board circuitry between 
the bridge’s binding posts and the binding posts to which the samples 
are connected. 

The sample boards are made from teflon impregnated sheets of 
woven fiberglass. Within the chamber the upper side of each quarter- 
inch thick board is plated with 3-mil copper that serves as the ground 
plane for eight strip lines on the board’s under side. The strip lines 
provide the connections to the board’s “high” binding posts. The 
thermal conductivity from the binding posts to the outside of the 
chamber is low enough so that at steady state the temperatures of the 
binding posts are within 0.1° Celsius of the chamber’s temperature. 
The difference between the inductance and resistance of the strip line 
to a board’s short-circuited binding posts and the inductances and 
resistances of the strip lines to the board’s sample positions are 
corrected for in data reduction via stored values for these differences. 
Similarly, the capacitance differences between the open-circuited strip 
line and the strip lines to the sample positions are corrected for in data 
reduction. However, the capacitance differences vary nonreproducibly 
with temperature, humidity, and recent history. These nonreproduci- 
ble variations cannot be corrected for and therefore introduce errors 
in environmental coefficient measurements. The worst case variation 
over the environment range of the chamber is +0.2 pF. 

Increased speed for environmental coefficient measurements was 
achieved by changing the software to take advantage of the similarities 
of many of the measurements. All the samples on one board are 
required to be nominally the same. Also, the impedances of the samples 
do not change drastically with the changes in environmental condi- - 
tions. Consequently, the complete measuring process is used only for 
the first environmental condition and then only for the short-circuit, 
the open-circuit, and the first sample positions on each sample board. 
For the second through sixth sample positions only the unknown 
balances are made; the bridge configuration information and reference 
balance data are retained from the first sample position. For the 
second and following environmental conditions the bridge configura- 
tion information obtained during the first environmental condition is 
used to set up the bridge for balancing. As a result of the decreased 
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number of reference balances and determinations of approximate 
values for the unknowns, the average time for a measurement is less 
than 10 seconds, versus about 20 seconds for a general purpose mea- 


surement. 
Increased accuracy for environmental coefficient measurements was 


achieved by changing the software so that measurements of small 
changes in admittance will not involve changes in the settings of 
decades having relatively large steps. Otherwise, small fractional 
changes in the admittances of the large-step decades could cause large 
errors in the measurements of small changes in admittance. This 
avoidance of changes in the large-step decades is done by making two 
balances instead of one when the bridge can be balanced at more than 
one setting of the standards. In these cases one balance uses zeros in 
the setting and the other balance uses tens. Thus, if a balance could be 
made at a setting of 2-0-0.-0-0 (i.e., the 100-pF per step decade set to 
two; the 10-pF, to zero; the 1-pF, to zero; etc.), it would be. However, 
a second balance would also be made at a setting of 1-9-9.-9-10. (For 
simplicity, in this example it is assumed that the actual value of a 
setting equals its nominal value.) The setting of the standards and the 
admittance at balance would be saved for both balances. Then, if at 
the next temperature the setting at balance were higher, say 2-0-0.-0- 
9, the difference would be calculated with respect to the 2-0-0.-0-0 
setting. But if the setting were lower, say 1-9-9.-9-1, the difference 
would be computed with respect to the 1-9-9.-9-10 setting. 


7.2 Disaccommodation factor measurements 


Measurements of the disaccommodation factors of ferromagnetic 
materials are also made using the environmental chamber. The user 
specifies temperature, relative humidity, soak time, temperature rate 
restriction, measurement frequency, and whether measurements are 
wanted 100 and 1000 minutes after demagnetization. (Measurements 
1 and 10 minutes after demagnetization are always made.) In addition, 
for each sample board the user specifies peak demagnetization current 
and test signal level. 

In the measurements the temperature is held fixed at the user- 
specified value, the components are individually demagnetized, and 
their inductances are measured at the appropriate times after demag- 
netization. Demagnetization is done with a 60-Hz current that de- 
creases linearly to zero in 15 seconds. The software for these measure- 
ments makes use of the zero- and ten-settings to optimize the accuracy 
for small admittance changes. Also, the software includes a reorgani- 
zation of the major blocks of the measurement process to make the 
time sequencing of demagnetizations and measurements so efficient 
that the demagnetizations and one-minute measurements of all 18 
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samples are done before the first ten-minute measurement is to be 
made. 


Vill. SUMMARY 


The computer-operated impedance/admittance bridge (cozy) is an 
easy-to-use facility that provides fast, highly accurate measurements 
over a wide impedance range at frequencies between 200 Hz and 30 
MHz. The accuracy is comparable to that achieved with specially 
developed manual bridges, but the expertise, care, and time required 
of the user are a tenth to a hundredth of that required for manual 
bridges. Also, the expertise and effort required to check and maintain 
the calibrations are similarly less. This is especially true since Cozy’s 
admittance-frequency coverage exceeds that of five manual bridges. 

In addition to high accuracy, Cozy also provides high resolution for 
measuring small changes in impedance/admittance. This attribute has 
proved to be very valuable in studies of stability versus shock and 
vibration and also versus time, and has been capitalized on by the 
addition of hardware and software that provide automatic measure- 
ments of environmental coefficients and disaccommodation factors. 

The speed, accuracy, and resolution of cozy have been extensively 
used in the development and evaluation of materials, structures, and 
complete components for ferromagnetic inductors and transformers. 
In this work cozy has permitted measurements and evaluations that 
would have been otherwise impossible. 
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1980), p 76. 


Informational Aspects of Stochastic Control. H.S. Witsenhausen, in Analysis 
and Optimization of Stochastic Systems, edited by Jabobs et al., New York: Academic, 
1980, pp 272-84. 


On Intersections of Interval Graphs. H. S. Witsenhausen, Discrete Math, 31 
(August 1980), pp 211-6. 


Problem E2741. H.S. Witsenhausen, Am Math Mon, 87 (January 1980) (solution). 
[Refer to Am Math Mon, 85 (November 1978), p 765 (problem). ] 


Representation and Approximation of Noncooperative Sequential Games. W. 
Whitt, SIAM J Control Optim, 18 (January 1980), pp 33-48. 


Some Useful Functions for Functional Limit Theorems. W. Whitt, Math Op Res, 
5 (February 1980), pp 67-85. 


COMPUTING 


Planning for the Bell Operations Systems Network. J.J. Amoss, Proc 5th Int 
Conf Computer Commun (October 80), pp 559-63. 


What is an Online Search? D. T. Hawkins and C. P. Brown, Online, 4, No. 1 
(January 1980), pp 12-8. 


ENGINEERING 


Alpha-Particle-Induced Soft Errors and 64K Dynamic RAM Design Interac- 
tion. R. J. McPartland, J. T. Nelson, and W. R. Huber, Proc Int Reliability Physics 
Symp, 1980 (April 8-10, 1980), pp 261-7. 

Ambipolar Transport in Double Heterostructure Injection Lasers. P. J. An- 
thony and N. E. Schumaker, IEEE Electron Device Lett, EDL-1, No. 4 (April 1980), pp 
58-60. 


Analysis of Microsegregation in Crystals. L. O. Wilson, J Cryst Growth, 48 
(March 1980), pp 363-6. 


Constant-Amplitude Antenna Arrays with Beam Patterns Whose Lobes Have 
Equal Magnitudes. M. R. Schroeder, Arch Elektr Ubertragung, 34 (1980), pp 165-8. 


Digital Transmission of Commentary-Grade (7 KHz) Audio at 56 or 64 kb/s. J. 
D. Johnston and D. J. Goodman, IEEE Trans Commun, COM-28, No. 1 (January 1980), 
pp 136-8. 

Dissolution and Formation of Intermetallics in the Soldering Process. W. G. 
Bader, Proc Seminar Phys Metallurgy Metals Joining (1980), pp 257-68. 

Effect of Sidewalls on Wavenumber Selection in Rayleigh-Binard 
Convection. M.C. Cross, P. G. Daniels, P. C. Hohenberg, and E. D. Siggia, Phys Rev 
Lett, 45 (15 September 1980), pp 898-901. 

Electroless Gold Plating on II]-V Compound Crystals. L.A. D’Asaro, S. Naka- 
hara, and Y. Okinaka, J Electrochem Soc, 127, No. 9 (September 1980), pp 1935-40. 
Fiber Measurement Standards. A. H.Cherin and W. B. Gardner, Laser Focus, 16, 
No. 8 (August 1980), pp 60-5. 

The Free Volume Concept and Its Implications on Dilatation in Glassy Polymers 
Under Shear Stresses. T.T. Wang and S. Matsuoka, J Polym Sci Polym Lett Ed, 
18 (September 1980), pp 593-8. 

FT3-The Bell System’s Metropolitan Trunk Lightwave System. I. Jacobs and J. 
R. Stauffer, Sixth European Conference on Optical Communication (Pub. No. 190), New 
York: Institution of Electrical Engineers (1980), pp 431-4. 

Geometric Effects on the Gate-Controlled Capacitor. E. B. Slutsky and J. N. 
Zemel, IEEE Trans Electron Devices, ED-27 (September 1980), pp 1843-6. 
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Improving the Properties of Recycled Plastics—Successful Case Histories. G. 
H. Bebbington, Soc Plastics Engineers—Regional Tech Conf (SPE-RETEC) Preprints 
#5 (October 7, 1980). 


Increasing the Wettability of Polymeric Battery Separators to Nonaqueous 
Electrolytic Solutions by Cross-Linking with Active Species in a Glow Dis- 
charge. J.J. Auborn and H. Schonhorn, Proc Symp Lithium Batteries, Electrochem 
Soc (October 6-10, 1980). 


Inhomogeneous Thermal Degradation of Poly(vinylidene Fluoride) Crystal- 
lized From the Melt. A. J. Lovinger and D. J. Freed, Macromolecules, 13 (1980), pp 
989-94. 


Laboratory Testing of New Feature Generics. R. C. Eisele, Proceedings of the 
National Electronics Conference 1980, 34 (1 October 1980), pp 294-7. 


400A Linewidth Lithography on Thick Silicon Substrates. R. E. Howard, E. L. 
Hu, L. D. Jackel, P. Grabbe, and D. M. Tennant, Appl Phys Lett, 36 (April 1980), pp 
592-4. 


Low Cobalt CrCoFe and CrCoFe-X Permanent Magnet Alloys. M. L. Green, R. 
C. Sherwood, G. Y. Chin, J. H. Wernick, and J. Bernardini, [IEEE Trans Mag, MAG- 16, 
No. 5 (September 1980), pp 1053-5. 


A Low-High Junction Solar-Cell Model Developed for Use in Tandem Cell 
Analysis. R.I. McPartland and A. G. Sabris, Solid State Electronics, 23 (June 1980), 
pp 605-10. 


Low Leakage Current and Saturated Reverse Characteristic in Broad-Area 
Indium and Gallium and Arsenide and Phosphide Diodes. fF. Capasso, R. A. 
Logan, P. W. Foy, and S. Sumski, Electron Lett, 16, No. 7 (March 1980), pp 241-2. 


Miniature Packaged Crystal Oscillators. R. E. Paradysz, D. M. Embree, U. R. 
Saari, and R. J. McClure, Proceedings of the 34th Annual Symposium on Frequency 
Control 1980 (1980), pp 475-87. 


A New Family of Ferroelectric Materials With Composition A,BMO3F3. J. 
Ravez, G. Peravdeau, H. Arend, S. C. Abrahams, and P. Hagenmuller, Ferroelectrics, 26 
(March 1980), pp 767-9. 


An NMOS Analog Building Block for Telecommunications Applications. P. E. 
Fleischer, K. R. Laker, D. G. Marsh, J. P. Ballantyne, and A. A. Yiannoulos, IEEE 
Trans Circuits Systems, CAS-27 (June 1980), pp 552-9. 


A Note on the Effect of Incident Beam Convergence on Quantitative Electron 
Energy Loss Spectroscopy. D.C. Joy, D. M. Maher, and R. C. Farrow, Microbeam 
Analysis 1980, edited by D. B. Wittny, San Francisco, California: San Francisco Press, 
1980, pp 154-7. 


Nuclear Magnetic Resonance Analysis of the Microstructure of Poly(chloro- 
prene sulfone). R. E. Cais and G. J. Stuk, Macromolecules, 13 (1980), pp 415-26. 


Observations of Negative Resistance Associated With Superlinear Emission 
Characteristics of (Al,Ga)As Double-Heterostructure Lasers. P.J. Anthony, T. 
L. Paoli, and R. L. Hartman, IEEE J Quantum Electron, QE-16, No. 7 (July 1980), pp 
735-9. 


An Optimal Design of LSI CMOS Polycells. S. M. Kang, Proc 1980 IEEE Int 
Symp Circuits Systems, 3 (April 1980), pp 1008-10. 

Optimization of Concatenated Fiber Bandwidth Via Differential Mode 
Delay. M. J. Buckler, Tech Dig, Symp Optical Fiber Measurements, 1980 (October 
190), pp 59-62. 

Optimization of Optoacoustic Cell for Depth Profiling Studies of Semiconductor 
Surfaces. A.C.Tam and Y. H. Wong, Appl Phys Lett, 36, No. 5 (15 March 1980), pp 
471-3. 

PBS—Positive Electron Resist—Capabilities and Limitations. M. J. Bowden, 
R. F. W. Pease, L. D. Yau, J. Frackoviak, L. E. Thompson, J. G. Skinner, and J. P. 
Ballantyne, in H. Ahmed and W. C. Nixon, Eds., Microcircuit Engineering, Cambridge: 
Cambridge U.P., pp 239-54. 

Photon Correlation Spectroscopy Near the Glass Transition in Polymers. G. 
D. Patterson and J. R. Stevens, ACS Polym Preprints, 21, No. 2 (1980), pp 16-7. 
Photon Correlation Spectroscopy of Polystyrene Solutions. G.D. Patterson, J.- 
P. Jarry, and C. P. Lindsey, Macromolecules, 13 (1980), pp 668-70. 

Physical Level Protocols. H. V. Bertine, IEEE Trans Commun, COM-28, No. 4 
(April 1980), pp 435-14, 
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Pumping Hazardous Gases. D.B. Fraser, J. L. Vossen, D. M. Hoffman, H. L. Pinch, 
R. M. Brown, M. Baron, and L. F. Dahlstedt, Am Vac Soc Mono, Pumping Hazardous 
Gases (June 1980). 


Rapid Interferometric Examination of Glass for Index Inhomogeneity. A. D. 
White, Appl Optics, 78 (1 August 1979), p 2525. 


Real Time Simulation of a Multichannel Interframe Coder for Video Confer- 
encing. A. B. Larsen, E. F. Brown, and J. M. Santelle, 1980 Canadian Commun & 
Power Conf (October 1980), pp 485-8. 


Signal and Noise Response of a Spectrum Expander. H. E. Rowe, IEEE Trans 
Circuits Systems, CAS-27 (September 1980), pp 804-15. 


Science and Technology in Lightwave Telecommunications. 5S. E. Miller, Tele- 
commun J, 47 (June 1980), pp 375-8. 


Single Crystal AgBr Infrared Optical Fibers. T. J. Bridges, J. S. Hasiak, and A. 
R. Strnad, Optics Lett, 5 (March 1980), pp 85-6. 


Solid Phase Solder Bonding for Use in the Assembly of Microelectronic Cir- 
cuits. R.H. Minetti, Proc 1980 Int Microelectron Symp, pp 126-9. 


Solution to a Solderability Problem by Combining Metallography and Thermal 
Analysis. J. E. Bennett, T. M. Paskowski, and H. E. Bair, in Proceedings of the 
Tenth NATA Conference, Boston, Massachusetts: North American Thermal Analysis 
Soc, 1980, pp 185-91. 


Superconductivity and Resistivity of Amorphous Niobium Germanium AIl- 
loys. S. Kosinki, Phys Lett, 764, No. 2 (March 80), pp 160-7. 


Techniques of Ultra Fine Pattern Generation. R.E. Howard, Solid State Technol, 
23 (August 1980), pp 127-32. 


Temperature Dependence of the Losing Threshold Current of Double Heter- 
ostructure. P.J. Anthony and N. E. Schumaker, J Appl Phys, 51, No. 9 (September 
1980), pp 5038-40. 


Textured Thin-Film Si Solar Selective Absorbers Using Reactive Ion 
Etching. H. G. Craighead, R. E. Howard, and D. M. Tennant, Appl Phys Lett, 37 (1 
October 1980), pp 653-5. 


Thermal Shearing Effects on the Temperature Stability of Saw Devices. R. L. 
Rosenberg, IEEE Trans Sonics & Ultrasonics, SU-27 (May 1980), pp 130-3. 


Thermocompression Bondability of Thick Film Gold, A Comparison to Thin 
Film Gold. N.T. Panousis and R. C. Kershner, Proc 1980 Electron Components Conf 
(April 1980), pp 472-7. 


On the Thickness and Spatial Distribution of a Fluoropolymer Film Deposited 
by Solution Dipping. H. G. Tompkins and S. P. Shorma, J Apply Polym Sci, 25 
(February 1980), pp 211-222. 


Transport Properties of sn-doped Al,Ga).,.As Grown by Molecular Beam Epi- 
taxy. H. Morkog, A. Y. Cho, and C. Radice, Jr., J Appl Phys, 51, No. 9 (September 
1980), pp 4882-4. 


Trap-Controlled Transient Photoconductivity in Dielectrics. KR. J. Fleming, J 
Appl Phys, 50 (1979), pp 8075-81. 

Two-Dimensional Interaction of Ion-Acoustic Solitons. P. A. Folkes, H. Ikezi, 
and R. Davis, Phys Rev Lett, 71 (15 September 1980), pp 902-4. 

Ultrasmall Superconducting Josephson Junction. E. L. Hu, R. E. Howard, L. D. 


Jackel, L. A. Fetter, and D. M. Tennant, IEEE Trans Electron Devices, ED-27 (October 
1980), pp 2030-1. 


Waveguide Propagation in Frozen Gas Matrices. R. Rossetti and L. E. Brus, Rev 
Sci Instrum, 51 (1980), pp 467-70. 


SOCIAL AND LIFE SCIENCES 


Acoustic Inverse Scattering as a Means for Determining the Area Function of 
a Lossy Vocal Tract: Theoretical and Experimental Model Studies. J. R. 
Resnick, J Acoust Soc Am, 67 (February 1980), p 722. 


Acoustics in Human Co. munications: Room Acoustics, Music, and 
Speech. M.R. Schroeder, J Acoust Soc Am, 68, No. 1 (July 1980), pp 22-8. 
Application of Dynamic Time Warping to Connected Digit Recognition. L. R. 
Rabiner and C. E. Schmidt, IEEE Trans Acoustics, Speech, and Signal Processing, 
ASSP-28 (August 1980), pp 377-88. 
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Application of Isolated Word Recognition to a Voice Controlled Repertory 
Dialer System. L. R. Rabiner, J. G. Wilpon, and A. E. Rosenberg, Proc 1980 Int Conf 
Acoustics, Speech, and Signal Processing (April 1980), pp 182-5. 

Cochlear Macromechanics—Time Domain Solutions. J. B. Allen and M. M. 
Sondhi, J Acoust Soc Am, 66 (July 1979), pp 123-32. 

Cochlear Models—1978. J.B. Allen, in Workshop on Models of the Auditory System 
and Related Signal Processing Techniques, Stockholm: Almquist and Wiksell, 1979, 
pp 1-16. 

Computer Studies on Parametric Coding of Speech Spectra. J. L. Flanagan and 
S. W. Christensen, J Acoust Soc Am, 68, No. 2 (August 1980), pp 420-30. 

A Connected Digit Recognizer Based on Dynamic Time Warping and Isolated 
Digit Templates. L. R. Rabiner and C. E. Schmidt, Proc 1980 Int Conf Acoustics, 
Speech, and Signal Processing (April 1980), pp 194-8. 

Considerations in Applying Clustering Techniques to Speaker Independent 
Word Recognition. L. R. Rabiner and J. G. Wilpon, Proc IEEE Int Conf Acoustics, 
Speech, and Signal Processing (April 1979), pp 578-81. 


Experimental Studies in a New Automatic Speaker Verification System Using 
Telephone Speech. S. Furui and A. E. Rosenberg, Proc 1980 IEEE Int Conf Acous- 
tics, Speech, and Signal Processing, 3 (April 1980), pp 1060-2. 

Experiments with New Telecommunications Service Capabilities. R. E. Card- 
well, ISSSL 80 Proc, IEEE Cat # 80 CH1565-1 (September 1980), pp 187-91. 


A Model for Advanced Reservations for Large Scale Conferencing 
Services. H. Luss, J Op Res Soc, 31 (March 1980), pp 239-45. 


On the Implementation of a Short-Time Spectral Analysis Method for System 
Identification. L. R. Rabiner and J. B. Allen, IEEE Trans Acoustics, Speech, and 
Signal Processing, ASSP-28 (February 1980), pp 69-78. 


Invertibility of a Room Impulse Response. 5S. T. Neely and J. B. Allen, J Acoust 
Soc Am, 66 (July 1979), pp 165-9. 

An Overview of Speech Recognition Research at Bell Laboratories. S. E. 
Levinson, Second Annual IHKEE Eng in Medicine and Biology Conf Proc (September 
1980), pp 87-91. 

Parametric Coding of Speech Spectra. J. L. Flanagan, J Acoust Soc Am, 68, No. 
2 (August 1980), pp 412-9. 


Procedures for Conducting a Successful Telemeeting. C. Stockbridge and T. B. 
Bateman, Ninth Int Symp Human Factors in Telecommun (October 1980), pp 199-204. 


Signal Models for Low Bit-Rate Coding of Speech. J. L. Flanagan, K. Ishizaka, 
and K. L. Shipley, J Acoust Soc Am, 68, No. 3 (September 1980), pp 780-91. 

Speaker Independent, Isolated Word Recognition for a Moderate Size (54 Word) 
Vocabulary. L. R. Rabiner and J. G. Wilpon, IEREE Trans Acoustics, Speech, and 
Signal Processing, ASSP-27 (December 1979), pp 583-7. 


Speaker Independent Recognition of Isolated Words Using Clustering Tech- 
niques. L.R. Rabiner, 8S. E. Levinson, A. E. Rosenberg, and J. G. Wilpon, IEEE Trans 
Acoustics, Speech, and Signal Processing, ASSP-27 (August 1979), pp 336-49. 
Statistical Properties of the Log Likelihood Ratio for LPC. J. M. Tribolet, L. R. 
Rabiner, and M. M. Sondhi, IEEE Trans Acoustics, Speech, and Signal Processing, 
ASSP-27 (October 1979), pp 550-555. 


Technique for Frequency Division/Multiplication of Speech Signals. J. L. 
Flanagan and S. W. Christensen, J Acoust Soc Am, 68, No. 4 (October 1980), pp 1061-8. 
Techniques for Expanding the Capabilities of Practical Speech Recognizers. J. 
L. Flanagan, S. E. Levinson, L. R. Rabiner, and A. E. Rosenberg, in Wayne Lea, Ed., 
Trends in Speech Recognition, Englewood Cliffs, NJ: Prentice-Hall, 1980, pp 425-44. 


MANAGEMENT AND ECONOMICS 


An Extended Application of the Delta(2) Measure of Predictor-Variable Impor- 
tance. LP. EK. Green, J. D. Carroll, and W. S. De Sarbo, Proc Am Market Assn 
Education Conf, 1979 (1979), pp 1-4. 

SPC Network Planning in the Bell System. S. Horing, J. J. Lawser, and R. L. 
Simms, Jr., Proc Int Symp Telecommun Networks Plan (October 1980), pp 307-11. 
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